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HUMAN MATRIX METALLOPROTEASE GENE, PROTEINS ENCODED THEREFROM 
AND METHODS OF USING SAME 

Background of the Invention 

5 

1. Field of the Invention 

The present invention relates generally to therapeutic and diagnostic agents in cancer 
and inflammatory diseases. More specifically, the present invention relates to polynucleotide 
sequences encoding matrix metalloprotease proteins, as well as methods which utilize these 
1 0 sequences, which are useful for the detection, diagnosis, staging, monitoring, prognosis, 
prevention, or treatment of cancer or inflammatory diseases. 

2. Description of the Related Art 

The matrix metalloproteases (MMPs) are a multi-enzyme family capable of completely 

1 5 degrading the components of the extracellular matrix (ECM), their natural substrates (W.B. 

Ennis, and L. M. Matrisian, J. Neuro-Oncology, 18: 105- 109 (1994)). The ECM is a 
mesh work of cells and various types of collagens and proteoglycans, collectively called 
connective tissue, which provides mechanical support and helps to maintain the structural 
integrity of tissues and organs. The function of the ECM is particularly apparent in articular 

2 0 cartilage where it provides cushioning and ease of movement between bones in joints. The 

MMPs are secreted by the cellular components of the ECM (fibroblasts, chondrocytes and 
synoviocytes) and inflammatory cells (neutrophils and macrophages) in inactive forms 
(zymogens) which are converted extraceliulariy to the active enzymes by various proteinases. 
Normally MMPs function in a highly regulated fashion as part of the physiological turnover of 

2 5 the ECM, effectively renewing and remodeling the ECM. However, in the clinical features of 

several diseases, the ECM is degraded and there is much evidence to support that MMPs play a 
significant pathological role in ECM degradation. 

At least fourteen members of the MMP family have been identified and most assigned 
EC numbers; several more have been discovered recently. They can be classified generally 

3 0 according to four subgroups based on substrate preference or cellular localization; e.g., 

collagenases prefer Type I and II collagen, gelatinases prefer Type IV collagen, and 
stromelysins prefer proteoglycans (L. M. Matrisian, Bio. Essays 14: 455-463 (1992)). The 
fourth subgroup are the membrane type MMPs (miMMPs) which are characterized by the 
presence of a hydrophobic transmembrane domain near the C-terminus for anchoring the 
3 5 protein in the cell membrane. All of the other MMPs arc secreted into the extracellular milieu. 

Most of the known MMPs contain zinc in their catalytic sites and require calcium for 
activity. The major human MMPs have been cloned and exhibit greater than 50% homology. 
They contain a leader sequence for signaling their secretion by cells; a highly conserved pro- 
enzyme sequence removed upon activation; a catalytic site with a highly conserved zinc 
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binding domain; and a carboxy terminal region containing a conserved sequence similar to 
hemopexin, a heme binding protein. Although MMPs can be readily activated in vitro using 
mercurial compounds or trypsin, the precise mechanism for propeptide removal and activation 
of MMPs in vivo is not understood. Some MMPs can undergo an autoactivation process, 
5 while recent evidence indicates that membrane type MMPs may function as activators of other 
MMPs. 

A growing line of evidence implicates the MMPs as important enzymes in cancer 
metastasis. Although different cancer cell lines have been shown to express various MMPs 
when grown in culture, gelatinase A in particular has been the focus of a number of recent 
1 0 studies which demonstrates its role in the invasiveness of cancer cells (W. G. Stetler- 

Stevenson, et al, FASEBJ. 7: 1434-1441 (1993)). For example, gelatinase A is found in the 
urine of bladder cancer patients and specific monoclonal antibodies have been used to detect 
the enzyme in breast tumor sections (I.M. Margulies, et al, 1; 467-474 (1992)). 
The enzyme is expressed in an invasive prostate cancer cell line (PC-3 ML) and cells 

1 5 transfected with the gelatina.se A gene arc capable of extravasation when injected into mice (M. 

E. Stearns, and M. Wang, Oncology Res, 6: 195-201 (1994)). These studies and others 
implicate the involvement of gelatinase A in tumor metastasis, and suggest that inhibitors of 
this enzyme may offer therapeutic potential in certain forms of cancer. A broad spectrum 
matrix mctalloproteinase inhibitor has been shown to decrease the tumor burden of mice 

2 0 bearing ovarian carcinoma xenographs (B. Davies, et al., Cancer Res. 53: 2087-2091 

(1993)). This compound (BB-94, batimastat) is currently being evaluated in clinical trials for 
malignant ascites (S.A. Watson, et al.. Cancer Research, 55: 3629-3633 (1995)); however, its 
poor bioavailability necessitates parental administration. Gelatinase A selective succinyl 
hydroxamates have been suggested as anti-cancer agents as well, yet these compounds possess 

2 5 the same peptidic backbone as batimastat (Porter, J. R.; Beeley, N. R. A.; Boyce, B. A.; 

Mason, B.; Millican, A.; Millar, K.; Leonard, J.; Morphy, J. R.; O'Oonnell, J. P. Potent and 
selective inhibitors of gelatinase A L Hydroxamic acid derivatives. Bioorg. Med. Chem. Leu., 
A. 2741-2746 (1994)). More recently, an orally active, broad spectrum, MMP inhibitor (BB- 
2516, marimastat) was reported to stop progression of colorectal, ovarian, prostatic and 

3 0 pancreatic cancer (R. P. Beckett, et al., D. D. T., 1: 16-26 (1996)). 

Several lines of evidence indicate that the unregulated activity of MMPs is responsible 
for the joint degradation observed in rheumatoid arthritis and osteoarthritis. In these human 
arthritides. activated forms of the MMPs and their products (glycosaminoglycans and collagen 
fragments; are found in synovial fluids and joint tissues in abnormally high amounts (see E.D. 
3 5 Harris Jr., Role of cotlagenase in joint destruction. The Joints and Synovial Fluid, Vol., 1 , 
Sokologg, L., Ed., Orlando. FL, Academic Press, 1977, T.E. Cawston, et al.. Arthritis 
Rheum., 27: 285-290 (1984), Cawston, T. Ann. Rheumatic Diseases., 52: 769-770 (1993), 
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Y.L. Okada, et ai, J. Biol. Chem., 261: 14245-14255 (1986), D.L. Scott, et ai, Molec. 
Aspects Med., 12: 341-394 { 1991), and E.D. Harris, et ai, Arthritis Rheum., 12: 92-102 
(1992)). These arthritic tissues also show a greater-than-normal expression of MMPs (S.S. 
McCachren, Arthritis and Rheumatism , 34: 1085-1093 (1991 ), A J. P. Docherty and G. 
5 Murphy, Ann. Rheumat. Dis., 49: 469-479 ( 1990)) which is induced by cytokines and 

growth factors, also found abundantly in these tissues (D.L. Scott, et ai t (1991), supra, S.M. 
Frisch and H.E. Ruley, J. Biol. Chem., 262: 16300-16304 (1987)). In addition, the 
activities of MMPs in normal tissues are thought to be regulated by the presence of 
endogenous tissue inhibitors of MMPs, (TIMPs, see T.E. Cawston, Curr. Med. Lit. Rheum., 
1 0 3:1 27-0 ( 1 984)). The ratio of the amounts of TIMP and MMPs is thought to maintain a 

balance between the rates of degradation and synthesis of ECM. In tissues from rheumatoid 
arthritics, an abnormally high expression of MMPs results in an imbalance of these enzymes 
and degradation of ECM (S.S. McCachren ( 1991) and T.E. Cawston (1984), supra). Thus in 
arthritis, inhibition of the exacerbated degradative activities of MMPs by specific agents could 

1 5 help restore this balance. In rodent models which mimic the biochemical features of arthritis, 

there is evidence that the combined action of proteoglycan loss (due to stromelysin activity) 
and cartilage degradation (due to collagenase) are early events in this disease (R.M. Hembry, 
et al.,Am. J. Pathol, 143: 628-642 (1993), K.A. Hasty, et ai, Arthritis and Rheumatism , 
33: 388-397 (1990)). Several prototype inhibitors of MMPs have been shown to reduce 

2 0 cartilage degradation in these animal models (M.J. DiMartino, et ai, J. Cell Binchem. suppl., 

19E: 179 (1991), P. Brown, et ai, Orally active inhibitors of cartilage degradation. Abst. # 
81, Abstracts of Inflammation '93, Vienna, Austria, p. 29. (1993)). 

There is much evidence to suggest that MMPs mediate the migration of inflammatory 
cells into endothelium (D., Moscatelli and D.B. Rifkin, Biochim. Biophys. Acta., 948: 67-85 

2 5 (1988), P. Zaoui, et ai. Matrix metal loproteases (MMP) exocytosis from neutrophils is 

inhibited by endothelial adhesion. Abst. # 83, Abstracts of Inflammation '93, Vienna, Austria, 
p. 29 (1993)) participating in periodontal diseases (H. Birkedal-Hansen, J. Peridontoi, 64; 
474-484 (1993)) and facilitating the growth of atherosclerotic plaques (A. M. Henny, et ai, 
Proc. Natl. Acad. Sci., 8: 8154-8158 (1991)). Recently, gelatinasc-A was reported to 

3 0 promote cleavage of the amyloid protein precursor which would suggest a role in Alzheimers 

disease for this MMP (N. Peress, et ai, J. Neuropathol. Exp. Neurol. , 54: 16-22 (1995), 
R.N. Lipagc, et ai, FEBS Lett., 377: 267-270 (1995)). Thus, there is compelling evidence 
that MMPs play an important role in arthritis and other inflammatory diseases and th;it targeted 
inhibition of these proteinases by pharmaceutical agents could have beneficial effects. 
3 5 h would be advantageous to provide specific methods and reagents for the diagnosis, 

staging, prognosis, monitoring, prevention or treatment of diseases and conditions associated 
with imbalances in the production or activity of MMPs or to indicate possible predisposition to 
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these conditions. Such methods would include assaying a test sample for products of the 
gene. Such methods would comprise making cDNA from mRNA in the test sample, 
amplifying (when necessary) portions of the cDNA corresponding to the gene or a fragment 
thereof, and detecting the cDNA product as an indication of the presence of the cancer: or 
detecting translation products of the mRNAs comprising gene sequences as an indication of the 
presence of the disease. These reagents include polynucleotide(s), or fragment(s) thereof 
which may be used in diagnostic methods such as reverse transcriptase-polymerase chain 
reaction (RT-PCR), PCR, or hybridization assays of biopsied tissue; or proteins which are the 
translation products of such mRNAs; or antibodies directed against these proteins. Such 
assays would include methods for assaying a sample for product(s) of the gene and detecting 
the product(s) as an indication of disease. Drug treatment or gene therapy for conditions or 
diseases associated with these detected diseases and conditions then can be based on these 
identified gene sequences or their expressed proteins, and efficacy of any particular therapy 
can be monitored using the diagnostic methods disclosed herein. 

Studies to understand the role of MMPs in tumor growth, metastases, or inflammatory 
conditions and diseases such as arthritis and their potential as a therapeutic or diagnostic tools 
are limited to previously described MMP proteins. It would be advantageous to identify novel 
human MMPs which may be directly involved in the physiology of, for example, tumor 
growth, extravasation, or invasion. Isolation of DNA sequences encoding human MMPs 
would permit more extensive studies on the association and regulation of individual MMPs in 
specific cancers or tumor types, and in diseases such as arthritis or other inflammatory 
conditions. In addition, the identification of tissue-specific or disease-specific MMPs would 
provide more direct targets for therapeutics designed to attenuate these diseases. 

Summary of the Invention 

This present invention provides an isolated and purified polynucleotide encoding a 
matrix metal loprotease (provisionally named MMP 19), polynucleotide fragments thereof, 
expression vectors containing those polynucleotides, host cells transformed with those 
expression vectors, processes for making the MMP protein using those polynucleotides and 
vectors, isolated and purified MMP protein and polypeptide fragments thereof, and antibodies 
raised to synthetic peptides derived from the MMP protein. The invention also provides 
diagnostic assays to identify the presence of the MMP polynucleotide or polypeptide, assays 
used to identify agents that affect the function of the MMP polynucleotide or polypeptide, a: id 
the use as therapeutic agents of the MMP polynucleotide, polypeptides, or anybodies. 

The cDNA clone was obtained by screening a human cDNA expression database with a 
consensus sequence to twelve other human MMPs. The sequence of the partial cDNA isolated 
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indicated that the gene product is a novel MMP protein that is expressed in a limited number of 
tissues. 

Brief Description of the Drawings 

5 

FIG. 1 shows the MMP consensus sequence used to query the LifeScq™ database. 
The functional motifs present in all MMPs are double underlined. 

FIG. 2 shows the complete nucleotide and translated sequence (frame 3) of EST 
1 0 #907334 in LifeSeq™. A putative cysteine switch motif (PRCGVTD which is SEQ ED NO:8) 
and a putative furin recognition site (RKKR which is SEQ ID NO:9) are double underlined. 

FIG. 3 shows the length and position of individual sequencing reactions to generate the 
double stranded nucleotide sequence of the insert in clone #907334. The boxes at the bottom 
1 5 are graphic representations of the three forward reading frames with start codons shown as 
short upward lines and stop codons as longer downward lines. Frame 1 contains the longest 
open reading frame. 



FIG. 4 shows the doubled stranded complete nucleotide sequence (SEQ ID NO: 1 and 
2 0 SEQ ID NO: 2, top and bottom strand respectively) of the MMP 19 gene and the translated 
amino acid sequence (SEQ ID NO:9) of MMP 19 (shown. beneath in single letter codes). 
Indicated in bold type are the structural domains found in other MMPs. The cysteine switch, 
furin recognition site, and zinc binding site are double underlined. The putative poly 
adenylation signal in the 3' untranslated region of the mRNA is single underlined. Boxes 

2 5 indicate the position of the 3' end of the primers used in the synthesis of DNA constructs for 

the expression of the catalytic portion- of MMP 19. 

FIG. 5 shows the alignment of the polypeptide encoded by SEQ ID NO: 10 with the 
amino acid sequences of all known human MMPs, namely, matrilysin (GenBank Accession 

3 0 No. L22524, SEQ ID NO: 1 8), Gelatinase A (GenBank Accession No. M55593, SEQ ID 

NO: 19), Gelatinase B (GenBank Accession No. J05070, SEQ ID NO:20), Stromelysin 1 
(GenBank Accession No. J03209, SEQ ID NO:21), Stromelysin 2 (GenBank Accession No. 
X07820, SEQ ID NO:22), Stromelysin 3, (GenBank Accession No. X57766, SEQ ID 
NO:23). Coliagenase 1 (GenBank Accession No. M ! 3509. SEQ ID NO:24), Coliagenase 2 
3 5 (GenBank Accession No. J05556, SEQ ID NO:25), Coliagenase 3 (GenBank Accession No. 
X75308, SEQ ID NO:26), MMP 1 2 (GenBank Accession No. L23808, SEQ ID NO:27), 
MMP 1 8 (GenBank Accession No. Y08622 or X9252 1 , SEQ ID NO:28), mt 1 MMP (GenBank 
Accession No. D26512, SEQ ID NO:29),"mt2MMP (GenBank Accession No. Z48482, SEQ 
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ID NO:30), mGMMP (GenBank Accession No. D50477, SEQ ID NO:3i), and mt4MMP 
(GenBank Accession No. X89576, SEQ ID NO:32). The consensus sequence was generated 
using a plurality of 5. Characteristic motifs found in MMPs are double underlined. 

FIG. 6 shows the alignment and tissue distibution of ESTs found in LifeSeq™ which 
are 100% identical to regions of MMPI9 cDNA. 

FIG. 7 shows a Phospholmage of a Northern blot of human RNA (2jig of poly A + 
RNA/!ane) probed with a 32 P-labeIed MMP19 cDNA probe. 

Detailed Description of the Invention 

The present invention provides isolated and purified polynucleotides that encode a 
human matrix metalloprotease, fragments thereof, expression vectors containing those 
polynucleotides, host cells transformed with those expression vectors, a process of making the 
MMP using those polynucleotides and vectors, and isolated and purified recombinant MMP 
and polypeptide fragments thereof. 

The present invention also provides for the use of MMP 19 in developing treatments for 
any disorder mediated (directly or indirectly) by insufficient amounts or production of MMP 19 
protein. Purified human MMP19 protein may be administered to a patient with such a 
condition. Alternatively, gene therapy techniques for producing MMPI9 polypeptide in vivo 
are also provided. 

The present invention also provides methods for assaying a test sample for products of 
the MMP gene, which comprises making cDNA from mRNA in the test sample, and detecting 
the cDNA as an indication of the presence of the MMP gene. The method may include an 
amplification step, wherein portions of the cDNA corresponding to the gene or fragment 
thereof is amplified. Methods also are provided for assaying for the translation products of 
mRNAs. Test samples which may be assayed by the methods provided herein include tissues, 
cells, body fluids and secretions. The present invention also provides reagents such as 
oligonucleotide primers and polypeptides which are useful in performing these methods. 

Portions of the nucleic acid sequences disclosed herein arc useful as primers for the 
reverse transcription of RNA or for the amplification of cDNA; or as probes to determine the 
presence of certain cDNA sequences in test samples. Also disclosed are nucleic acid 
sequences which permit the production of encoded polypeptide sequences which are useful as 
standards or reagents in viiagnostic immunoassays, targets for pharmaceutical screening assays 
and/or as components or target sites for various therapies. Monoclonal and polyclonal 
antibodies directed against at least one epitope contained within these polypeptide sequences 
are useful for diagnostic tests as well as delivery agents for therapeutic agents and for" 



WO 98/40475 



PCT/US98/04694 



7 

screening for diseases or conditions associated with cancers, arthritis, or inflammation. 
Isolation of sequences of other portions of the gene of interest can be accomplished by utilizing 
probes or PCR primers derived from these nucleic acid sequences, thus allowing additional 
probes and polypeptides of the genome of interest to be established, which also will be useful 
in the diagnosis, prognosis and/or treatment of diseases and conditions characterized by the 
MMP gene disclosed herein. 

The techniques for determining the amino acid sequence "similarity" are well-known in 
the art. In general, "similarity" means the exact amino acid to amino acid comparison of two 
or more polypeptides at the appropriate place, where amino acids are identical or possess 
similar chemical and/or physical properties such as charge or hydrophobicity. A so-termed 
"percent similarity" then can be determined between the compared polypeptide sequences. The 
techniques for determining nucleic acid and amino acid sequence identity also are well known 
in the art and include determining the nucleotide sequence of the mRNA for that gene (usually 
via a cDNA intermediate) and determining the amino acid sequence encoded therein, and 
comparing this to a second amino acid sequence. In general, "identity" refers to an exact 
nucleotide to nucleotide or amino acid to amino acid correspondence of two polynucleotides or 
polypeptide sequences, respectively. Two or more polynucleotide sequences can be compared 
by determining their "percent identity." Two amino acid sequences likewise can be compared 
by determining their "percent identity." The programs available in the Wisconsin Sequence 
Analysis Package (available from Genetics Computer Group, Madison, WI), for example, the 
GAP program, are capable of calculating both the identity between two polynucleotides and the 
identity and similarity between two polypeptide sequences, respectively. Other programs for 
calculating identity or similarity between sequences are known in the art. 

The compositions and methods described herein will enable the identification of certain 
markers as indicative of cancer, arthritis, or inflammation resulting from or associated with an 
MMP 19 disorder; the information obtained therefrom will aid in the diagnosis, staging,, 
monitoring, prognosis and/or therapy of those same diseases or conditions. Test methods 
include, for example, probe assays which utilize the sequence(s) provided herein and which 
also may utilize nucleic acid amplification methods such as the polymerase chain reaction 
(PCR), the ligase chain reaction (LCR); and hybridization. In addition, the nucleotide 
sequences provided herein contain open reading frames from which one or more immunogenic 
epitopes may be found. Such epitopes are believed to be unique to the disease state or 
condition associated with MMP 19 related cancer, arthritis, or inflammation. The uniqueness 
of any epitope may be determined by its immunological reactivity with the specific MMP gene 
in diseased tissues and lack of immunological reactivity with non-diseased tissues. Methods 
for determining immunological reactivity are well-known and include but are not limited to, for 
example, radioimmunoassay (R1A), enzyme-linked immunosorbent assay (ELISA), 
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hemagglutination (HA), fluorescence polarization immunoassay (FPIA); chemilurrunescent 
immunoassay (CLIA), and others; several examples of suitable methods are described herein. 
Unless otherwise stated, the following terms shall have the following meanings: 
A polynucleotide "derived from" a designated sequence refers to a polynucleotide 
5 sequence which is comprised of a sequence of approximately at least about 6 nucleotides, is 

preferably at least about 8 nucleotides, is more preferably at least about 10-12 nucleotides, and 
even more preferably is at least about 15-20 nucleotides corresponding, i.e., identical to or 
complementary to, a region of the designated nucleotide sequence. The sequence may be 
complementary to or identical to a sequence which is unique to a particular polynucleotide 
1 0 sequence as determined by techniques known in the art. Comparisions to sequences in 

databanks, for example, can be used as a method to determine the uniqueness of a designated 
sequence. Regions from which sequences may be derived include but are not limited to 
regions encoding specific epitopes, as well as non-translated and/or non-transcribed regions. 
The derived polynucleotide will not necessarily be derived physically from the 

1 5 nucleotide sequence of interest under study, but may be generated in any manner, including 

but not limited to chemical synthesis, replication, reverse transcription or transcription, which 
is based on the information provided by the sequence of bases in the region(s) from which the 
polynucleotide is derived; as such, it may represent either a sense or an antisense orientation of 
the original polynucleotide. In addition, combinations of regions corresponding to that of the 

2 0 designated sequence may be modified in ways known in the art to be consistent with an 

intended use. 

The term "primer*' denotes a specific oligonucleotide sequence complementary to a 
target nucleotide sequence and used to hybridize to the target nucleotide sequence and serve as 
an initiation point for nucleotide polymerization catalyzed by either DNA polymerase, RN A 

2 5 polymerase or reverse transcriptase. 

The term "probe" denotes a defined nucleic acid segment (or nucleotide analog 
segment, i.e., PNA) which can be used to identify specific DNA or RNA present in samples 
bearing the complementary sequence. 

A "polypeptide" or "amino acid" sequence derived from a designated nucleic acid 

3 0 sequence refers to a polypeptide having an amino acid sequence identical to that of a 

polypeptide encoded in the sequence or a portion thereof wherein the portion consists of at 
least 3 to 5 contiguous amino acids, and more preferably at least 8 to 10 contiguous amino 
acids, and even more preferably 15 to 20 continguous amino acids, or which is 
immunologically identifiable with a polypeptide encoded in the sequence. 
3 5 A "recombinant polypeptide" as used herein means at least a polypeptide which by 

virtue of its origin or manipulation is not associated with all or a portion of the polypeptide 
with which it is associated in nature and/or is linked to a polypeptide other than that to which it 
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is linked in nature. A recombinant or derived polypeptide is not necessarily translated from a 
designated nucleic acid sequence. It also may be generated in any manner, including chemical - 
synthesis or expression of a recombinant expression system. 

The term "synthetic peptide" as used herein means a polymeric form of amino acids of 
5 any length, which may be chemically synthesized by methods well-known to the routineer. 
These synthetic peptides are useful in various applications. 

The term "polynucleotide" as used herein means a polymeric form of nucleotides of . 
any length, either ribonucleotides or deoxyribonucleotides. This term refers only to the 
primary structure of the molecule. Thus, the term includes double- and single-stranded DNA, 
1 0 as well as. double- and single-stranded RNA. It also includes modifications, such as 
methylation or capping, and unmodified forms of the polynucleotide. 

'"A sequence corresponding to a cDNA" means that the sequence contains a 
polynucleotide sequence that is identical to or complementary to a sequence in the designated 
DNA. The degree (or "percent") of identity or complementarity to the cDNA will be 

1 5 approximately 50% or greater, will preferably be at least about 70% or greater, and more 

preferably will be at least about 90%. The sequence that corresponds will be at least about 50 
nucleotides in length, will preferably be about 60 nucleotides in length, and more preferably, 
will be at least about 70 nucleotides in length. The correspondence between the gene or gene 
fragment of interest and the cDNA can be determined by methods known in the art, and 

2 0 include, for example, a direct comparison of the sequenced material with the cDNAs 

described, or hybridization and digestion with single strand nucleases, followed by size 
determination of the digested fragments. 

"Purified polynucleotide" refers to a polynucleotide of interest or fragment thereof 
which is essentially free, i.e., contains less than about 50%, preferably less than about 70%, 

2 5 and more preferably, less than about 90% of the protein with which the polynucleotide is 

naturally associated. Techniques for purifying polynucleotides of interest are well-known in 
the art and include, for example, disruption of the cell containing the polynucleotide with a 
chaotropic agent and separation of the polynucleotide(s) and proteins by ion-exchange 
chromatography, affinity chromatography and sedimentation according to density. Thus, 

3 0 "purified polypeptide" means a polypeptide of interest or fragment thereof which is essentially 

free, that is, contains less than about 50%, preferably less than about 70%, and more 
preferably, less than about 90% of cellular components with which the polypeptide of interest 
is naturally associated. Methods for purifying are known in the art. 

The term "isolated" means that the material is removed from its original environment 
3 5 (e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or DNA or polypeptide, which is separated from some or ail of the coexisting 
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materials in the natural system, is isolated. Such polynucleotide could be part of a vector 
and/or such polynucleotide or polypeptide couid be part of a composition, and still be isolated 
in that the vector or composition is not part of its natural environment. 

"Polypeptide" as used herein indicates a molecular chain of amino acids and does not 
5 refer to a specific length of the product. Thus, peptides, oligopeptides and proteins are 

included within the definition of polypeptide. This term, however, is not intended to refer to 
post-expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like. 

"Recombinant host cells," "host cells," "ceils," "cell lines," "cell cultures," and other 
1 0 such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular 

entities refer to cells which can be, or have been, used as recipients for recombinant vector or 
other transferred DNA, and include the original progeny of the original cell which has been 
transfected. 

As used herein "replicon" means any genetic element, such as a plasmid, a 

1 5 chromosome or a virus, that behaves as an autonomous unit of polynucleotide replication 

within a cell. 

A "vector" is a replicon in which another polynucleotide segment is attached, such as to 
bring about the replication and/or expression of the attached segment. 

The term "control sequence" refers to polynucleotide sequences which are necessary to 

2 0 effect the expression of coding sequences to which they arc ligatcd. The nature of such control 

sequences differs depending upon the host organism. In prokaryotes, such control sequences 
generally include promoter, ribosomal binding site and terminators; in eukaryotes, such control 
sequences generally include promoters, terminators and, in some instances, enhancers. The 
term "control sequence" thus is intended to include at a minimum all components whose 

2 .5 presence is necessary for expression, and also may include additional components whose 

presence is advantageous, for example, leader sequences. 

"Operably linked" refers to a situation wherein the components described arc in a 
relationship permitting them to function in their intended manner. Thus, for example, a control 
sequence "operably linked" to a coding sequence is ligated in such a manner that expression of 

3 0 the coding sequence is achieved under conditions compatible with the control sequences. 

The term "open reading frame" or "ORF" refers to a region of a polynucleotide 
sequence which encodes a polypeptide; this region may represent a portion of a coding 
sequence or a total coding sequence. 

A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and 
3 5 translated into a polypeptide when placed under the control of appropriate regulatory 

sequences. The boundaries-of the coding sequence are determined by a translation start codon 
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at the 5' -terminus and a translation stop codon at the 3' -terminus. A coding sequence can 
include, but is not limited to, mRNA, cDNA, and recombinant polynucleotide sequences. 

The term "immunologically identifiable with/as" refers to the presence of epitope(s) and 
polypeptide(s) which also are present in and are unique to the designated polypeptide(s). 
5 Immunological identity may be determined by antibody binding and/or competition in binding. 
These techniques are known to the routineer and also are described herein. The uniqueness of 
an epitope also can be determined by computer searches of known data banks, such as 
GenBank, for the polynucleotide sequences which encode the epitope, and by amino acid 
sequence comparisons with other known proteins. 
1 0 As used herein, "epitope" means an antigenic determinant of a polypeptide. 

Conceivably, an epitope can comprise three amino acids in a spatial conformation which is 
unique to the epitope. Generally, an epitope consists of at least five such amino acids, and 
more usually, it consists of at least eight to ten amino acids. Methods of examining spatial 
conformation are known in the art and include, for example, x-ray crystallography and two- 

1 5 dimensional nuclear magnetic resonance. 

A "conformational epitope" is an epitope that is comprised of specific juxtaposition of 
amino acids in an immunologically recognizable structure, such amino acids being present on 
the same polypeptide in a contiguous or non-contiguous order or present on different 
polypeptides. 

2 0 A polypeptide is "immunologically reactive" with an antibody when it binds to an 

antibody due to antibody recognition of a specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody binding, more particularly by the 
kinetics of antibody binding, and/or by competition in binding using as competitors) a known 
polypeptide(s) containing an epitope against which the antibody is directed. The methods for 

2 5 determining whether a polypeptide is immunologically reactive with an antibody are known in 

the art. 

As used herein, the term "immunogenic polypeptide containing an epitope of interest" 
means naturally occurring polypeptides of interest or fragments thereof, as well as 
polypeptides prepared by other means, for example, by chemical synthesis or the expression 

3 0 of the polypeptide in a recombinant organism. 

The term "transformation" refers to the insertion of an exogenous polynucleotide into a 
host celljrrespective of the method used for the insertion. For example, direct uptake, 
transduction or f-mating are included. The exogenous polynucleotide may be maintained as 
non-integrated vector, for example, a plasmid, or alternatively, may be integrated into the ho.si 
3 5 genome. 

"Treatment" refers to prophylaxis and/or therapy. 
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The term "individual" as used herein refers to vertebrates, particularly members of the 
mammalian species and includes but is not limited to domestic animals, sports animals, 
primates and humans; more particularly the term refers to humans. 

The term "sense strand" or "plus strand" (or 4 V) as used herein denotes a nucleic acid 
5 that contains the sequence that encodes the polypeptide. The term "antisense strand" or "minus 
strand" (or "-") denotes a nucleic acid that contains a sequence that is complementary to that of 
the "plus" strand. 

The term "test sample" refers to a component of an individual's body which is the 
source of the analyte (such as, antibodies of interest or antigens of interest). These 
1 0 components are well known in the art. These test samples include biological samples which 
can be tested by the methods of the present invention described herein and include human and 
animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, lymph 
fluids, and various external secretions of the respiratory, intestinal and genitorurinary tracts, 
tears, saliva, milk, white blood cells, myelomas and the like: biological fluids such as cell 

1 5 culture supernatants; fixed tissue specimens; arid fixed cell specimens. 

• "Purified product" refers to a preparation of the product which has been isolated from 
the cellular constituents with which the product is normally associated, and from other types of 
cells which may be present in the sample of interest. 

"PNA" denotes a "peptide nucleic acid analog" which may be utilized in a procedure 

2 0 such as an assay described herein to determine the presence of a target. "MA" denotes a 

"morpholino analog" which may be utilized in a procedure, such as an assay described herein to 
determine the presence of a target. See, for example, U.S. Patent No. 5,378,84 1, which is 
incorporated herein by reference. PNAs are neutrally charged moieties which can be directed 
against RNA targets or DNA. PNA probes used in assays in place of, for example, the DN A 

2 5 probes of the present invention, offer advantages not achievable when DNA probes are used. 

These advantages include manufacturability, large scale labeling, reproducibility, stability, 
insensitivity to changes in ionic strength and resistance to enzymatic degradation which is 
present in methods utilizing DNA or RNA. These PNAs can be labeled with such signal 
generating compounds as fluorescein, radionucleotides, chemiluminescent compounds, and 

3 0 the like. PNAs or other nucleic acid analogs such as MAs thus can be used in assay methods 

in place of DNA or RNA. Although assays are described herein utilizing DNA probes, il is 
within the scope of the routineer that PNAs or MAs can be substituted for RNA or DNA with 
appropriate changes if and as needed in assay reagents. 

"Analyte," as used herein, is the substance to be detected which may be present in the 
3 5 test sample. The analyte can be any substance for which there exists a naturally occurring 

specific binding member (such as, an antibody), or for which a specific binding member can 
be prepared. Thus, an analyte is a substance that can bind to one or more specific binding 
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members in an assay. "Analyte" also includes any antigenic substances, haptens, antibodies, 
and combinations thereof. As a member of a specific binding pair, the analyte can be detected 
by means of naturally occurring specific binding partners (pairs) such as the use of intrinsic 
factor protein as a member of a specific binding pair for the determination of Vitamin B 1 2, the 
5 use of folate-binding protein to determine folic acid, or the use of a lectin as a member of a 
specific binding pair for the determination of a carbohydrate. The analyte can include a 
protein, a peptide, an amino acid, a nucleotide target, and the like. 

An "Expressed Sequence Tag" or "EST" refers to the partial sequence of a cDNA insert 
which has been made by reverse transcription of mRNA extracted from a tissue, followed by 
I 0 insertion into a vector. 

A "transcript image" refers to a table or list giving the quantitative distribution of ESTs. 
in a library and represents the genes active in the tissue from which the library was made. 

The present invention provides assays which utilize specific binding members. A 
"specific binding member," as used herein, is a member of a specific binding pair. That is. 

1 5 two different molecules where one of the molecules through chemical or physical means 

specifically binds to the second molecule. Therefore, in addition to antigen and antibody 
specific binding pairs of common immunoassays, other specific binding pairs can include 
biotin and avidin, carbohydrates and lectins, complementary nucleotide sequences, effector 
and receptor molecules, cofactors and enzymes, enzyme inhibitors and enzymes, and the like. 

2 0 Furthermore, specific binding pairs can include members that are analogs of the original 

specific binding members, for example, an analyte-analog. Immunoreactive specific binding 
members include antigens, antigen fragments, antibodies and antibody fragments, both 
monoclonal and polyclonal, and complexes thereof, including those formed by recombinant 
DNA molecules. 

2 5 The term "hapten," as used herein, refers to a partial antigen or non-protein binding 

member which is capable of binding to an antibody, but which is not capable of eliciting 
antibody formation unless coupled to a carrier protein. 

A "capture reagent," as used herein, refers to an unlabeled specific binding member 
which is specific either for the analyte as in a sandwich assay, for the indicator reagent or 

3 0 analyte as in a competitive assay, or for an ancillary specific binding member, which itself is 

specific for the analyte, as in an indirect assay. The capture reagent can be directly or 
indirectly bound to a solid phase material before the performance of the assay or during the 
performance of the assay, thereby enabling the separation of immobilized complexes from the 
test sample. 

3 5 The "indicator reagent" comprises a "signal-generating compound" ("label") which is 

capable of generating and generates a measurable signal detectable by external means, 
conjugated ("attached") to a specific binding member. "Specific binding member" as used 
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herein means a member of a specific binding pair. That is, two different molecules where one 
of the molecules through chemical or physical means specifically binds to the second molecule. 
In addition to being an antibody member of a specific binding pair, the indicator reagent also 
can be a member of any specific binding pair, including either hapten-anti-hapten systems such 
5 as biotin or anti-biotin, avidin or biotin, a carbohydrate or a lectin, a complementary nucleotide 
sequence, an effector or a receptor molecule, an enzyme cefaclor and an enzyme, an enzyme 
inhibitor or an enzyme, and the like. An immunoreactive specific binding member can be an 
antibody, an antigen, or an antibody/antigen complex that is capable of binding either to 
polypeptide of interest as in a sandwich assay, to the capture reagent as in a competitive assay, 

10 or to the ancillary specific binding member as in an indirect assay. When describing probes 

and probe assays, the term "reporter molecule" may be used. A reporter molecule comprises a 
signal generating compound as described hereinabove conjugated to a specific binding member 
of a specific binding pair, such as carbazol or adamantane. 

The various "signal-generating compounds" (labels) contemplated include 

15 chromogens, catalysts such as enzymes, luminescent compounds such as fluorescein and 

rhodamine, chemiluminescent compounds such as dioxetanes, acridiniums, phenanthridiniums 
and luminol, radioactive elements, and direct visual labels. Examples of enzymes include 
alkaline phosphatase, horseradish peroxidase, beta-galactosidase, and the like. The selection of 
a particular label is not critical, but it will be capable of producing a signal either by itself or in 

2 0 conjunction with one or more additional substances. 

"Solid phases" ("solid supports") arc known to those in the art and include the walls of 
wells of a reaction tray, test tubes, polystyrene beads, magnetic beads, nitrocellulose strips, 
membranes, microparticles such as latex particles, sheep (or other animal) red blood cells, and 
Duracytes® (red blood cells "fixed" by pyruvic aldehyde and formaldehyde, available from 

2 5 Abbott Laboratories, Abbott Park, 1L) and others. The "solid phase" is not critical and can be 

selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or. non- 
magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and Duracytes® are all suitable examples. 
Suitable methods for immobilizing peptides on solid phases include ionic, hydrophobic, 

3 0 covalent interactions and the like. A "solid phase", as used herein, refers to any material 

which is insoluble, or can be made insoluble by a subsequent reaction. The solid phase can be 
chosen for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent. The additional receptor can include a charged substance that is oppositely 
3 5 charged with respect to the capture reagent itself or to a charged substance conjugated to the 
capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid phase and which has the ability to 
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immobilize the capture reagent through a specific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid phase material before the 
performance of the assay or during the performance of the assay. The solid phase thus can be 
a plastic, derivatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
5 lube, microliter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, Duracytes® and other configurations known to those of ordinary skill in the art. 

It is contemplated and within the scope of the present invention that the solid phase also 
can comprise any suitable porous material with sufficient porosity to allow access by detection 
antibodies and a suitable surface affinity to bind antigens. Microporous structure generally arc 

1 0 ^preferred, but materials with gel structure in the hydrated state may be used as well. Such 

usefuf solid supports include but are not limited to nitrocellulose and nylon. It is contemplated 
that such porous solid supports described herein preferably are in the form of sheets of 
thickness from about 0.01 to 0.5 mm, preferably about 0.1mm. The pore size may vary 
within wide limits, and preferably is from about 0.025 to 15 microns, especially from about 
15 0. 15 to 1 5 microns. The surface of such supports may be activated by chemical processes 
which cause covalent linkage of the antigen or antibody to the support. The irreversible 
binding of the antigen or antibody is obtained, however, in general, by adsorption on the 
porous material by poorly understood hydrophobic forces. Other suitable solid supports are 
known in the art. 

2 0 Reagents . 

The present invention provides reagents such as human MMP19 polynucleotide 
sequences derived from an MMP19 of interest, polypeptides encoded therein, and antibodies 
developed from these polypeptides. The present invention also provides reagents such as 
oligonucleotide fragments derived from the disclosed polynucleotides and nucleic acid 

2 5 sequences complementary to the these polynucleotides. The polynucleotides or polypeptides 

or antibodies of the present invention may be used in the diagnosis, prognosis, and/or 
treatment of individuals with conditions associated with the expression of the MMP19 gene, 
' such as cancer, arthritis, or inflammation, or to identify a predisposition to these conditions. 
The sequences disclosed herein represent unique polynucleotides which can be used in assays 

3 0 or for producing a disease specific profile of gene transcription activity. 

Selected MMP19 polynucleotides can be used in the methods described herein for the 
detection of normal or altered gene expression. Such methods may employ the MMP19 
polynucleotides disclosed herein. or oligonucleotides, fragments or derivatives thereof, or 
nucleic acid sequences complementary to these polynucleotides. 
3 5 The polynucleotides disclosed herein, their complementary sequences or fragments of 

either can be used in assays to detect, amplify or quantify genes, cDNAs.or mRNAs relating to 
cancer, arthritis, or inflammation and associated conditions. They also can be used to identify 
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an entire or partial coding region which encodes for MMP19 polypeptide. "They further can be 
provided in individual containers in the form of a kit for assays, or provided as individual 
compositions. If provided in a kit for assays, other suitable reagents such as buffers, 
conjugates and the like may be included. 
5 The polynucleotide(s) may be in the form of mRNA or DNA. Polynucleotides in the 

form of DNA, cDN A, genomic DNA, and synthetic DNA are within the scope of the present 
invention. The DNA may be double-stranded or single-stranded, and if single stranded may 
be the coding (sense) strand or non-coding (anti-sense) strand. The coding sequence which 
encodes the polypeptide may be identical lo the coding sequence provided' herein or may be a 
1 0 different coding sequence which coding sequence, as a result of the redundancy or degeneracy 
of the genetic code, encodes the same polypeptide as the DNA provided herein. 

This polynucleotide may include only the coding sequence for the polypeptide, or the 
coding sequence for the polypeptide and additional coding sequence such as a leader or 
secretory sequence or a proprotein sequence, or the coding sequence for the polypeptide (and 

1 5 optionally additional coding sequence) and non-coding sequence, such as a non-coding 

sequence 5' and/or 3' of the coding sequence for the polypeptide. 

In addition, the invention includes variant polynucleotides containing modifications 
such as polynucleotide deletions, substitutions or additions: and any polypeptide modification 
resulting from the variant polynucleotide sequence. A polynucleotide of the present invention 

2 0 also may have a coding sequence which is a naturally occurring allelic variant of the coding 

sequence provided herein. 

In addition, the coding sequence for the polypeptide may be fused in the same reading 
frame to a polynucleotide sequence which aids in expression and secretion of a polypeptide 
from a host cell, for example, a leader sequence which functions as a secretory sequence for 

2 5 controlling transport of a polypeptide from the cell. The polypeptide having a leader sequence 

is a preprotein and may have the leader sequence cleaved by the host cell to form the 
polypeptide. The polynucleotides may also encode for a proprotein which is the protein plus 
additional amino acid residues at the N-terminus. A protein having a prosequence is a 
proprotein and may in some cases be an inactive form of the protein. Once the prosequence is 

3 0 cleaved an active protein remains. Thus, the polynucleotide of the present invention may 

encode for a protein, or for a protein having a prosequence or for a protein having both a 
presequence (leader sequence) and a prosequence. 

The polynucleotides of the present invention may also have the coding sequence fused 
in frame to a marker sequence which allows for purification of the polypeptide of the present 
3 5 invention. The marker sequence may be a hexa-histidine tag supplied by a pQE-9 vector to 
provide for purification of the polypeptide fused to the marker in the case of a bacterial host, 
or, for example, the marker sequence may be a hemagglutinin (HA) tag when a mammalian 
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host, e.g. COS-7 cells, is used. The HA tag corresponds to an epitope derived from the 
influenza hemagglutinin protein. See, for example, I. Wilson, et a!., Cell 37:767 (1984). 

It is contemplated that polynucleotides will be considered to hybridize to the sequences 
provided herein if there is at least 50%, and preferably at least 70% identity between the 
polynucleotide and the sequence. 

Probes constructed according to the polynucleotide sequences of the present invention 
can be used in various assay methods to provide various types of analysis. For example, such 
probes can be used in Fluorescent In Situ Hybridization (FISH) technology to perform 
. chromosomal analysis, and used to identify cancer-specific structural alterations in the 
chromosomes, such as deletions or translocations that are visible from chromosome spreads or 
detectable using PCR-generated and/or allele specific oligonucleotides probes, allele specific 
amplification or by direct sequencing. Probes also can be labeled with radioisotopes, direct- 
or indirectly- detectable haptens, or fluorescent molecules, and utilized for in_sjtu hybridization 
studies to evaluate the mRNA expression of the gene comprising the polynucleotide in fixed 
tissue specimens or cells. 

The present invention further relates to an MMP19 polypeptide which has the deduced 
amino acid sequence as provided herein, as well as fragments, analogs and derivatives of such 
polypeptide. The polypeptide of the present invention may be a recombinant polypeptide, a 
natural purified polypeptide or a synthetic polypeptide. The fragment, derivative or analog of 
the MMP19 polypeptide may be one in which one or more of the amino acid residues is 
substituted with a conserved or non-conserved amino acid residue (preferably a conserved 
amino acid residue) and such substituted amino acid residue may or may not be one encoded 
by the genetic code; or it may be one in which one or more of the amino acid residues includes 
a substitucnt group; or it may be one in which the polypeptide is fused with another 
compound, such as a compound to increase the half-life of the polypeptide (for example, 
polyethylene glycol); or it may be one in which the additional amino acids are fused to the. 
polypeptide, such as a leader or secretory sequence or a sequence which is employed for 
purification of the polypeptide or a proprotein sequence. Such fragments, derivatives and 
analogs are within the scope of the present invention. The polypeptides and polynucleotides of 
the present invention are preferably provided in an isolated form, and preferably purified. 

■ Thus, a polypeptide of the present invention may have ah amino acid sequence that is 
identical to that of the naturally occurring polypeptide or that is different by minor variations 
due to one or more amino acid substitutions. The variation may be a "conservative change" 
typically in the range of about 1 to 5 amino acids, wherein the substituted amino acid has 
similar structural or chemical properties, eg, replacement of leucine with isoleucine or 
threonine with serine. In contrast, variations may include nonconservative changes, eg, 
• replacement of a glycine with a tryptophan. Similar minor variations may also include amino 
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acid deletions or insertions, or both. Guidance in determining which and how many amino 
acid residues may be substituted, inserted or deleted without changing biological or 
immunological activity may be found using computer programs well known in the art, for 
example, DNASTAR software (DNASTAR Inc., Madison WI). 
5 Preferably, a polypeptide of the present invention is one having at least 40% identity, 

more preferably, at least 50% identity, even more preferably at least 60% identity and even 
more preferably, at least 70% identity to SEQ ID NO: 10. Such a polypeptide may also 
possess the ability to cleave at least the substrate described as SEQ ID NO: 13. 

The present invention also provides an antibody produced by using a purified MMP19 
1 0 polypeptide of which at least a portion of the polypeptide is encoded by MMP19 

polynucleotide selected from the polynucleotides provided herein. These antibodies may be 
used in the methods provided herein for the detection of MMP 19 polypeptides in test samples. 
The antibody also may be used for therapeutic purposes, for example, in neutralizing the 
activity of a MMP 19 polypeptide in conditions associated with altered or abnormal expression 

1 5 oftheMMP19gene. 

This invention also provides teachings as to the production of the polynucleotides and 
polypeptides provided herein. 
Probe Assays 

The sequences provided herein may be used to produce probes which can be used in 

2 0 assays for the detection of nucleic acids in test samples. The probes may be designed from 

conserved nucleotide regions of the polynucleotides of interest or from non-conserved 
nucleotide regions of the polynucleotide of interest. The design of such probes for 
optimization in assays is within the skill of the routineer. Generally, nucleic acid probes are 
developed from non-conserved or unique regions when maximum specificity is desired, and 

2 5 nucleic acid probes are developed from conserved regions when assaying for nucleotide 

regions that are closely related to, for example, different members of a multigene family or in 
related species like mouse and man. 

The polymerase chain reaction (PCR) is a technique for amplifying a desired nucleic 
acid sequence (target) contained in a nucleic acid or mixture thereof. In PCR, a pair of primers 

3 0 are employed in excess to hybridize at the outside ends of complementary strands of the target 

nucleic acid. The primers are each extended by a polymerase using the target nucleic acid as a 
template. The extension products become target sequences themselves, following dissociation 
from the original target strand. New primers then are hybridized and extended by a 
polymerase, and the cycle is repeated to geometrical i\ increase the number of target sequence 
3 5 molecules. PCR is disclosed in U.S. patents 4,683,195 and 4,683,202, which are 
incorporated herein by reference. 
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The Ligase Chain Reaction (LCR) is an alternate method for nucleic acid amplification. 
In LCR, probe pairs are used which include two primary (first and second) and two secondary 
(third and fourth) probes, all of which are employed in molar excess to target. The first probe 
hybridizes to a first segment of the target strand and the second probe hybridizes to a second 
5 segment of the target strand, the first and second segments being contiguous so that the 

primary probes abut one another in 5' phosphate-3' hydroxy 1 relationship, and so that a ligase 
can covalently fuse or ligate the two probes into a fused product. In addition, a third 
(secondary) probe can hybridize to a portion of the first probe and a fourth (secondary) probe 
can hybridize to a portion of the second probe in a similar abutting fashion. Of course, if the 
1 0 target is initially double stranded, the secondary probes also will hybridize to the target 

complement in the first instance. Once the ligated strand of primary probes is separated from 
the target strand, it will hybridize with the third and fourth probes which can be ligated to form 
a complementary, secondary ligated product. It is important to realize that the ligated products 
are functionally equivalent to either the target or its complement. By repeated cycles of 

1 5 hybridization and ligation, amplification of the target sequence is achieved. This technique is 

described more completely in EP-A- 320 308 to K. Backman published June 16, 1989 and 
EP-A-439 1 82 to K. Backman et al, published July 3 1 , 1991 , both of which are incorporated 
herein by reference. 

For amplification of mRNAs, it is within the scope of the present invention to reverse 

2 0 transcribe mRNA into cDNA followed by polymerase chain reaction (RT-PCR); or, to use a 

single enzyme for both steps as described in U.S. Patent No. 5,322,770, which is 
incorporated herein by reference; or reverse transcribe mRNA into cDNA followed by 
asymmetric gap ligase chain reaction (RT-AGLCR) as described by R. L. Marshall, ct al., 
PCR Methods and Amplications 4: 80-84.(1994), which also is incorporated herein by 

2 5 reference. 

Other known amplification methods which can be utilized herein include but are not 
limited to the so-called "NASBA" or "3SR" technique described in PNAS USA 87: 1 874- 1 878 - 
( 1 990) and also described in Nature 350 (No. 63 1 3):9 1 -92 ( 1 99 1 ) ; Q-beta amplification as 
described in published European Patent Application (EPA) No. 4544610; strand displacement 

3 0 amplification (as described in G. T. Walker et al., Clip. Chem . 42:9-13 [1 996]) and European 

Patent Application No. 684315; and target mediated amplication, as described by PCT 

Publication WO 9322461. 

In one embodiment, the present invention generally comprises the steps of contacting a 
test sample suspected of containing a target polynucleosis sequence with amplification 
3 5 reaction reagents comprising an amplification primer, and a detection probe that can hybrid.ze 
with an internal region of the amplicon sequences. Probes and primers employed according to 
the method herein provided are labeled with capture and detection labels wherein probes are 
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labeled with one type of label and primers are labeled with the other type of label. 
Additionally, the primers and probes are selected such that the probe sequence has a lower melt 
temperature than the primer sequences. The amplification reagents, detection reagents and test 
sample are placed under amplification conditions whereby, in the presence of target sequence, 
5 copies of the target sequence (an amplicon) are produced. In the usual case, the amplicon is 
double stranded because primers are provided to amplify a target sequence and its 
complementary strand. The double stranded amplicon then is thermally denatured to produce 
single stranded amplicon members. Upon formation of the single stranded amplicon 
members, the mixture is cooled to allow the formation of complexes between the probes and 
1 0 single stranded amplicon members. 

As the single stranded amplicon sequences and probe sequences are cooled, the probe 
sequences preferentially bind the single stranded amplicon members. This finding is 
counterintuitive given that the probe sequences are generally selected to be shorter than the 
primer sequences and therefore have a lower melt temperature than the primers. Accordingly, 

1 5 the melt temperature of the amplicon produced by the primers should also have a higher melt 

temperature than the probes. Thus, as the mixture is cooled, the re-formation of the double 
stranded amplicon is expected although as previously stated, this is not the case. Instead, the 
probes are found to preferentially bind the single stranded amplicon members. Moreover, this 
preference of probe/single stranded amplicon binding exists even when the primer sequences 

2 0 are added in excess of the probes. 

After the probe/single stranded amplicon member hybrids are formed, they are 
detected. Standard heterogeneous assay formats are suitable for detecting the hybrids using 
the detection labels and capture labels present on the primers and probes. The hybrids can be 
bound to a solid phase reagent by virtue of the capture label and detected by virtue of the 

2 5 detection label. In cases where the detection label is directly detectable, the presence of the 

hybrids on the solid phase can be detected by causing the label to produce a detectable signal, 
if necessary, and detecting the signal. In cases where the label is not directly detectable, the 
captured hybrids can be contacted with a conjugate, which generally comprises a binding 
member attached to a directly detectable label. The conjugate becomes bound to the complexes 

3 0 and the conjugates presence on the complexes can be detected with the directly detectable label. 

Thus, the presence of the hybrids on the solid phase reagent can be determined. Those skilled 
in the art will recognize that wash steps may be employed to wash away unhybridized 
amplicon or probe as well as unbound conjugate. 

A test sample is typically anything suspected of containing a target sequence. Test 
3 5 samples can be prepared using methodologies well known in the art such as by obtaining a 

specimen from an individual and, if necessary, disrupting any cells contained therein to release 
target nucleic acids. Although the target sequence is described as single stranded, it also is 
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contemplated to include the ca.se where the target sequence is actually double stranded'but is 
merely separated from its complement prior to hybridization with the amplification primer 
sequences. In the case where PCR is employed in this method, the ends of the target 
sequences are usually known. In cases where LCR or a modification thereof is employed in 
5 the preferred method, the entire target sequence is usually known. Typically, the target 
sequence is a nucleic acid sequence such as, for example, RNA or DNA, 

The method provided herein can include well known amplification reactions that utilize 
thermal cycle reaction mixtures, particularly PCR and GLCR. Amplification reactions 
typically employ primers to repeatedly generate copies of a target nucleic acid sequence, which 
1 0 target sequence is usually a small region of a much larger nucleic acid sequence. Primers are 
themselves nucleic acid sequences that are complementary to regions of a target sequence. 
Under amplification conditions, these primers hybridize or bind to the complementary regions 
of the target sequence. Copies of the target sequence typically are generated by the process of 
primer extension and/or ligation which utilizes enzymes with polymerase or ligase activity, 

1 5 separately or in combination, to add nucleotides to the hybridized primers and/or ligate adjacent 

probe pairs. The nucleotides that are added to the primers or probes, as monomers or 
preformed oligomers, are also complementary to the target sequence. Once the primers or 
probes have been sufficiently extended and/or ligated they are separated from the target 
sequence, for example, by heating the reaction mixture to a "melt temperature" which is one in 

2 0 which complementary nucleic acid strands dissociate. Thus, a sequence complementary to the 

target sequence is formed. 

A new amplification cycle then can take place to further amplify the number of target 
sequences by separating any double stranded sequences, allowing primers or probes to 
hybridize to their respective targets, extending and/or ligating the hybridized primers or probes 

2 5 and re-separating. The complementary sequences that are generated by amplification cycles 

can serve as templates for primer extension or filling the gap of two probes to further amplify 
the number of target sequences. Typically, a reaction mixture is. cycled between 20 and 100 
times, more typically, a reaction mixture is cycled between 25 and 50 times. The numbers of 
cycles can be determined by the routineer. In this manner, multiple copies of the target 

3 0 sequence and its complementary sequence are produced. Thus, primers initiate amplification 

of the target sequence when it is present under amplification conditions. 

Generally, two primers which are complementary to a portion of a target strand and its 
complement are employed in PCR. For LCR, four probes, two of which are complementary 
to a target sequence and two of which are similarly complementary to the targets complement, 
3 5 are generally employed. In addition to the primer sets and enzymes previously mentioned, a 
nucleic acid amplification reaction mixture may also comprise other reagents which are well 
known and include but are not limited to: enzyme cofactors such as manganese; magnesium; 
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salts; nicotinamide adenine dinucleotide (NAD); and deoxynucleotide triphosphates (dNTPs) 
such as for example deoxyadeninc triphosphate, deoxyguanine triphosphate, deoxycytosine 
triphosphate and deoxythyminc triphosphate. 

While the amplification primers initiate amplification of the target sequence, in some 
5 cases, the detection (or hybridization) probe is not involved in amplification. Detection probes 
are generally nucleic acid sequences or uncharged nucleic acid analogs such as, for example, 
peptide nucleic acids which are disclosed in International Patent Application WO 92/20702; 
morpholino analogs which are described in U.S. Patents Nos 5,185,444, 5,034,506, and 
5,142,047; and the like. Depending upon the type of label carried by the probe, the probe is 
1 0 employed to capture or detect the amplicon generated by the amplification reaction. The probe 
is not involved in amplification of the target sequence and therefore may have to be rendered 
"non-extendable" in that additional dNTPs cannot be added to the probe. In and of themselves 
analogs usually are non-extendable and nucleic acid probes can be rendered non-extendable by 
modifying the 3' end of the probe such that the hydroxy] group is no longer capable of 

1 5 participating in elongation. For example, the 3' end of the probe cari be functionalized with the 

capture or detection label to thereby consume or otherwise block the hydroxyl group. 
Alternatively, the 3' hydroxyl group simply can be cleaved, replaced or modified. U.S. Patent 
Application Serial No. 07/049,061 filed April 19, 1993 and incorporated herein by reference 
describes modifications which can be used to render a probe non-extendable. 

2 0 Accordingly, the ratio of primers to probes is not important. Thus, either the probes or 

primers can be added to the reaction mixture in excess whereby the concentration of one would 
be greater than the concentration of the other. Alternatively, primers and probes can be 
employed in equivalent concentrations. Preferably, however, the primers are added to the 
reaction mixture in excess of the probes. Thus, primer to probe ratios of, for example, 5: 1 and 

2 5 20:1 are preferred. 

While the length of the primers and probes can vary, the probe sequences are selected 
such that they have a lower melt temperature than the primer sequences. Hence, the primer 
sequences are generally longer than the probe sequences. Typically, the primer sequences are 
in the range of between 20 and 50 nucleotides long, more typically in the range of between 20 

3 0 and 30 nucleotides long. The typical probe is in the range of between 10 and 25 nucleotides 

long. 

Alternatively, a probe may be involved in the amplifying a target sequence, via a 
process kne - ■ "nested PCR". In nested PCR, the probe has characteristics which are 
similar to those of the first and second primers normally used for amplification (such as length, 
3 5 melting temperature etc.) and as such, may itself serve as a primer in an amplification reaction. 
Generally in nested PCR, a first pair of primers (Pj and P2) are employed to form primary 
extension products. One of the primary primers (for example. Pi) may optionally be a capture 
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primer (i.e. linked to a member of a first reactive pair), whereas the other primary primer {P2) 
is not. A secondary extension product is then formed using the P] primer and a probe (P2O 
which may also have a capture type label (such as a member of a second reactive pair) or a 
detection label at its 5' end. The probe is complimentary to and hybridizes at a site on the 
5 template near or adjacent (but not overlapping) the site where the 3' terminus of P2 would 
hybridize if it was still in solution. Thus, a labeled primer/probe set generates a secondary 
product which is shorter than the primary extension product. Furthermore, the secondary 
product may be detected either on the basis of its size or via its labeled ends (by detection 
methodologies well known to those of ordinary skill in the art). In this process, probe and 
1 0 primers are generally employed in equivalent concentrations. . 

Various methods for synthesizing primers and probes are well known in the art. 
Similarly, methods for attaching labels to primers or probes are also well known in the art. 
For example, it is a matter of routine to synthesize desired nucleic acid primers or probes using 
conventional nucleotide phosphoramidite chemistry and instruments available from Applied 

1 5 Biosystems. Inc., (Foster City, CA), Dupont (Wilmington, DE), or Milligen (Bedford MA). 

Many methods have been described for labeling oligonucleotides such as the primers or probes 
of the present invention. Enzo Biochemical (New York, NY) and Clontech (Palo Alto, CA) 
both have described and commercialized probe labeling techniques. For example, a primary 
amine can be attached to a 3' oligo terminus using 3'-Amine-ON CPG™ (Clontech, Palo Alto, 

2 0 CA). Similarly, a primary amine can be attached to a 5' oligo terminus using Aminomodifier 

II® (Clontech). The amines can be reacted to various haptens using conventional activation 
and linking chemistries. In addition, copending applications US. Serial Nos. 625,566, filed 
December 1 1, 1990 and 630,908, filed December 20, 1990, which are each incorporated 
herein by reference, teach methods for labeling probes at their 5* and 3' termini, respectively. 
25 Publications WO92/10505, published 25 June 1992 and WO 92/1 1388 published 9 July 1992 
teach methods for labeling probes at their 5' and 3 1 ends, respectively. According to one 
known method for labeling an oligonucleotide, a label-phosphoramidite reagent is prepared 
and used to add the label to the oligonucleotide during its synthesis. See, for example, NT. 
Thuong et al., Tet. Letters 29(46):5905-5908 (1988); or J. S. Cohen et al., published U.S. 

3 0 Patent Application 07/246,688 (NTIS ORDER No. PAT-APPL-7-246,688) (1989). 

Preferably, probes are labeled at their 3' and 5' ends. 

Capture labels are carried by the primers or probes and can be a specific binding 
member which forms a binding pair with the "*!id phase reagent's specific binding member. It 
will be understood, of course that the primer probe itself may serve as the capture label. 
3 5 For example, in the case where a solid phase reagent's binding member is a nucleic acid 

sequence, it may be selected such that it binds a complementary portion of the primer or probe 
to thereby immobilize the primer or probe to the solid phase. In cases where the probe itself 
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serves as the binding member, those skilled. in the art will recognize that the probe will. contain 
a sequence or "tail" that is not complementary to the single stranded amplicon members. In the 
case where the primer itself serves as the capture label, at least a portion of the primer will be 
free to hybridize with a nucleic acid on a solid phase because the probe is selected such that it 
5 is not fully complementary to the primer sequence. 

Generally, probe/single stranded amplicon member complexes can be detected using 
techniques commonly employed to perform heterogeneous immunoassays. Preferably, in this 
embodiment, detection is performed according to the protocols used by the commercially 
available Abbott LCx® instrumentation (Abbott Laboratories, Abbott Park, IL). 
1 0 The primers and probes disclosed herein are useful in. typical PCR assays, wherein the 

test sample is contacted with a pair of primers, amplification is performed, the hybridization 
probe is added, and detection is performed. 

Another method provided by the present invention comprises contacting a lest sample 
with a plurality of polynucleotides wherein at least one polynucleotide is provided herein, 

1 5 hybridizing the test sample with the plurality of polynucleotides and detecting the hybridization 

complexes. The hybridization complexes are identified and quantitated to compile a profile 
which is indicative of cancer, metastases, arthritis or inflammatory conditions. Expressed 
RNA sequences may further be detected by reverse transcription and amplification of the DNA 
product by procedures well-known in the art, including polymerase chain reaction (PCR). 

2 0 Antisense and Gene Therapy . 

The present invention also encompasses the use of gene therapy methods for the 
introduction of anti-sense MMP19 gene derived molecules such as polynucleotides or 
oligonucleotides of the present invention into patients having conditions (such as cancer, 
arthritis or inflammation) that are associated with bnormal expression of MMP19 

2 5 polynucleotides. These molecules, including antisense RNA and DNA fragments and 

ribozymcs, are designed to inhibit the translation of a MMP19 polynucleotide mRNA, and may 
be used therapeutically in the treatment of conditions associated with altered or abnormal 
expression of the MMP19 polynucleotide. 

Alternatively, the oligonucleotides described above can be delivered to cells by 

3 0 procedures in the an such that the anti-sense RNA or DNA is subsequently expressed in vivo 

whereby it inhibits production of MMP19 polypeptide in the manner described above. 
Antisense constructs to MMP19 polynucleotide, therefore, reverse the action of MMP19 
transcripts and may be used for treating cancer, arthritis, inflammation and related disease 
states. These antisense constructs may also be used to treat tumor metastases. 
3 5 Antisense technology can be used to control gene expression through triple-helix 

formation or antisense DNA or RNA, both of which methods arc based on binding of a 
polynucleotide to DNA or RNA. For example, the 5' coding portion of the polynucleotide 
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sequence, which encodes for the polypeptide of the present invention, is used to design an 
antisense RNA oligonucleotide of from 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be complementary to a region of the gene involved in transcription, thereby 
preventing transcription and the production of MMP19 polypeptide. For triple helix, see, for 
5 example, Lee ct al, Nucl. Acids Res . 6:3073 (1979): Cooney et al. Science 241:456 (1988); 
and Dervan et al. Science 251:1360 (1991) The antisense RNA oligonucleotide hybridizes to 
the mRNA in vivo and blocks translation of an mRNA molecule into the MMP19 polypeptide. 
For antisense, see, for example, Okano. J. Neurochem . 56:560 (1991); and 
"Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression", CRC Press, Boca 
1 0 Raton, Fla. ( 1 988). Antisense oligonucleotides act with greater efficacy when modified to 
contain artificial intern ucleotide linkages which render the molecule resistant to nucleolytic 
cleavage. Such artificial internucleotide linkages include but are not limited to 
methylphosphonate, phosphorothiolate and phosphoroamydate internucleotide linkages. 
The present invention also encompasses gene therapy whereby 1 the gene encoding 

1 5 MMP19 is regulated in a patient. Various methods of transferring or delivering DNA to cells 

for expression of the gene product protein, otherwise referred to as gene therapy, are disclosed 
in "Gene Transfer into Mammalian Somatic Cells in vivo", N. Yang, Crit. Rev. Biotechn. 
12(4): 335-356 (1992), which is hereby incorporated by reference. Gene therapy 
encompasses incorporation of DNA sequences into somatic cells or germ line cells for use in 

2 0 either ex vivo or in vivo therapy. Gene therapy functions to replace genes, augment normal or 

abnormal gene function, and to combat infectious diseases and other pathologies. 

Strategies for treating medical problems with gene therapy include therapeutic strategies 
such as identifying the defective gene and then adding a functional gene to either replace the 
function of the defective gene or to augment a slightly functional gene; or prophylactic 

2 5 strategies, such as adding a gene which encodes a protein product that will treat the condition 

or that will make the tissue or organ more susceptible to a treatment regimen. 

Many protocols for transfer of MMPI9 DNA or MMP19 regulatory sequences arc 
envisioned in this invention. Transfection of promoter sequences, other than one specifically 
associated with MMP19, or other sequences which would increase production of MMP19 

3 0 protein are also envisioned as methods of gene therapy. An example of this technology is 

found in Transkaryotic Therapies, inc., of Cambridge, Massachusetts, using homologous 
recombination to insert a "genetic switch" that turns on an erythropoietin gene in cells. See 
Genetic Engineering News . April 15, 1994. Such "genetic switches" could be used to activate 
MMP19 (or a MMP19 receptor) in cells not normally expressing these proteins. 
3 5 Gene transfer methods for gene therapy fall into three broad categories: ( 1 ) physical 

(e.g., electroporation, direct gene transfer and particle bombardment), (2) chemical (e.g. lipid- 
based carriers and other non-viral vectors) and (3) biological (e.g. virus derived vectors). For 
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example, non-viral vectors such as liposomes coated with DNA may be directly injected 
intravenously into the patient. It is believed that the liposome/DNA complexes are 
concentrated in the liver where they deliver the DNA to macrophages and Kupffer cells. 
Additionally, vectors or the "naked" DNA of the gene may be directly injected into the desired 
5 organ, tissue or tumor for targeted delivery of the therapeutic DNA. 

Gene therapy methodologies can also be described by delivery site. Fundamental ways 
to deliver genes include ex vivo gene transfer, in vivo gene transfer, and in vitro gene transfer. 
In ex vivo gene transfer, cells are taken from the patient and grown in cell culture. The DNA 
is transfected into the cells, the transfected cells are expanded in number and then reimplanted 
1 0 in the patient. In in vitro gene transfer, the transformed cells are cells growing in culture, such 
as tissue culture ceils, and not particular cells from a particular patient. These "laboratory 
cells" are transfected, the transfected cells are selected and expanded for either implantation 
into a patient or for other uses. In vivo gene transfer involves introducing the DNA into the 
cells of the patient when the cells are within the patient. AH three of the broad based categories 

1 5 described above may be used to achieve gene transfer in vivo, ex vivo, and in vitro. 

Mechanical (i.e. physical) methods of DNA delivery can be achieved by direct injection 
of DNA, such as microinjection of DNA into germ or somatic cells, pneumatically delivered 
DNA-coated particles, such as the gold particles used in a "gene gun," and inorganic chemical 
approaches such as calcium phosphate transfection. It has been found that physical injection 

2 0 of plasmid DNA into muscle cells yields a high percentage of cells which arc transfected and 

have a sustained expression of marker genes. The plasmid DNA may or may not integrate into 
the genome of the cells. Non-integration of the transfected DNA would allow the transfection 
and expression of gene product proteins in terminally differentiated, non-proliferativc tissues 
for a prolonged period of time without fear of mutational insertions, deletions, or alterations in ~ 

2 5 the cellular or mitochondria] genome. Long-term, but not necessarily permanent, transfer of 

therapeutic genes into specific cells may provide treatments for genetic diseases or for 
prophylactic use. The DNA could be reinjected periodically to maintain the gene product level 
without mutations occurring in the genomes of the recipient cells. Non-integration of 
exogenous DNAs may allow for the presence of several different exogenous DNA constructs 

3 0 within one cell with all of the constructs expressing various gene products. 

Particle-mediated gene transfer may also be employed for injecting DNA into cells, 
tissues and organs. With a particle bombardment device, or "gene gun," a motive force is 
generated to accelerate DNA-coated high density particles (such ^ gold or tungsten) to a high 
velocity that allows penetration of the target organs, tissues or ceiis. Electroporation ior gene 
3 5 transfer uses an electrical current to make cells or tissues susceptible to electroporation- 

mediated gene transfer. A brief electric impulse with a given Held strength is used to increase 
the permeability of a membrane in such a way that DNA molecules can penetrate into the cells. 



WO 98/40475 



PCT/US98/04694 



The techniques of particle-mediated gene transfer and electroporation are well known to those 
of ordinary skill in the art. 

Chemical methods of gene therapy involve carrier mediated gene transfer through the 
use of fusogenic lipid vesicles such as liposomes or other vesicles for membrane fusion. A 
5 carrier harboring a DNA of interest can be conveniently introduced into body fluids or the 
bloodstream and then site specifically directed to the target organ or tissue in the body. 
Liposomes, for example, can be developed which are cell specific or organ specific. The 
foreign DNA carried by the liposome thus will be taken up by those specific cells. Injection of 
immunoliposomes that are targeted to a specific receptor on certain cells can be used as a 
1 0 convenient method of inserting the DNA into the cells bearing the receptor. Another carrier 
. system that has been used is the asialoglycoprotein/polylysine conjugate system for carrying 
DNA to hepatocytes form vi vo gene transfer. 

Transfected DNA may also be complexed with other kinds of carriers so that the DNA 
is carried to the recipient cell and then resides in the cytoplasm or in the nucleoplasm of the 

1 5 recipient cell. DNA can be coupled to carrier nuclear proteins in specifically engineered vesicle 

complexes and carried directly into the nucleus. 

Carrier mediated gene transfer may also involve the use of lipid-based proteins which 
are not liposomes. For example, lipofectins and cytofectins are lipid-based positive ions that 
bind to negatively charged DNA, forming a complex that can ferry the DNA across a cell 

2 0 membrane. Another method of carrier mediated gene transfer involves receptor-based 

endocyto.sis. In this method, a ligand (specific to a cell surface receptor) is made to form a 
complex with a gene of interest and then injected into the bloodstream; target cells that have the 
cell surface receptor will specifically bind the ligand and transport the ligand-DNA complex 
into the cell. 

2 5 Biological gene therapy methodologies usually employ viral vectors to insert genes into 

cells. The term "vector" as used herein in the context of biological gene therapy means a 
carrier that can contain or associate with specific polynucleotide sequences and which 
functions to transport the specific polynucleotide sequences into a cell. The transfected cells 
may be cells derived from the patient's normal tissue, the patient's diseased tissue, or may be 

3 0 non-patient cells. Examples of vectors include plasmids and infective microorganisms such as 

viruses, or non-viral vectors such as the ligand-DNA conjugates, liposomes, and lipid-DNA 
complexes discussed above. 

It may be desirable that a recombinant DNA molecule comprising a MMP19 DNA 
sequence is operatively linked to an expression control sequence to form an expression vector 
3 5 capable of expressing MMP19. Alternatively, gene regulation of MMP19 may be 

accomplished by administering proteins that bind to control regions associated with the 
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MMP19 gene, or its corresponding RNA transcript to modify the rate of transcription or 
translation. 

Viral vectors that have been used for gene therapy protocols include but are not limited 
to, retroviruses, other RNA viruses such as poliovirus or Sindbis virus, adenovirus, adeno- 
5 associated virus, herpes viruses, SV 40, vaccinia and other DNA viruses. Replication- 
defective murine retroviral vectors are the most widely utilized gene transfer vectors. Murine 
leukemia retroviruses are composed of a single strand RNA completed with a nuclear core 
protein and polymerase (pol) enzymes, encased by a protein core (gag) and surrounded by a 
glycoprotein envelope (env) that determines host range. The genomic structure of retroviruses 
1 0 include the gag, pol, and env genes enclosed at by 5' and 3' long terminal repeats (LTR). 

Retroviral vector systems exploit the fact that a minimal, vector containing the 5* and 3' LTRs 
and the packaging signal are sufficient to allow vector packaging, infection and integration into 
target cells providing that the viral structural proteins are supplied in trans in the packaging cell 
line. Fundamental advantages of retroviral vectors for gene transfer include efficient infection 

1 5 and gene expression in most cell types, precise single copy vector integration into target cell 

chromosomal DNA, and ease of manipulation of the retroviral genome. For example, altered 
retrovirus vectors have been used in ex vivo methods to introduce genes into peripheral and 
tumor-infiltrating lymphocytes, hepatocytes, epidermal cells, myocytes, or other somatic cells 
(which may then introduced into the patient to provide the gene product from the inserted 

2 0 DNA). 

The adenovirus is composed of linear, double stranded DNA complexed with core 
proteins and surrounded with capsid proteins. Advances in molecular virology have fed to the 
ability to exploit the biology of these organisms to create vectors capable of transducing novel 
genetic sequences into target cells in vivo . Adenoviral-based vectors will express gene 

2 5 product peptides at high levels. Adenoviral vectors have high efficiencies of infectivity, even 

with low titers of virus. Additionally, the virus is fully infective as a cell free virion so 
injection of producer cell lines are not necessary. Another potential advantage to adenoviral 
vectors is the ability to achieve long term expression of heterologous genes in vivo. 

Viral vectors have also been used to insert genes into cells using in vivo protocols. To 

3 0 direct tissue-specific expression of foreign genes, cis-acting regulatory elements or promoters 

that are known to be tissue specific can be used. Alternatively, this can be achieved using in 
.situ delivery of DNA or viral vectors to specific anatomical sites in vivo. For example, gene 
transfer to blood vessels in vivo was achieved by implanting in vitro transduced endothelial 
cells in chosen sites on arterial walls. The virus infected rounding cells which also 
3 5 expressed the gene product. A viral vector can be delivered directly to the in vivo site, by a 
catheter for example, thus allowing only certain areas to be infected by the virus, and 
providing long-term, site specific gene expression, in vivo gene transfer using retrovirus 
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vectors has also been demonstrated in mammary tissue and hepatic tissue by injection of the 
altered virus into blood vessels leading to the organs. 

The present invention also provides a method of screening a plurality of compounds 
5 for specific binding to a MMP 19 polypeptide, or any fragment thereof, to identify at least one 
compound which specifically binds the MMP 19 polypeptide. Such a method comprises the 
steps of providing at least one compound; combining the MMP 19 polypeptide with each 
compound under suitable conditions for a time sufficient to allow binding; and detecting 
MMP 19 polypeptide binding to each compound. 
1 0 The present invention also provides a method of screening a plurality of compounds 

for inhibition of the activity of a MMP 19 polypeptide, or any fragment.thereof, to identify at 
least one compound which specifically inhibits the activity of the MMP 19 polypeptide. Such a 
method comprises the steps of providing at least one compound; combining the MMP 19 
polypeptide with each compound under suitable conditions for a time sufficient to allow 

1 5 interaction or binding; and detecting MMP 19 polypeptide inhibition by each compound. 

The polypeptide or peptide fragment employed in such a test may either be free in 
solution, affixed to a solid support, borne on a cell surface or located intracellular^. One 
method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably 
transformed with recombinant nucleic acids which can express the polypeptide or peptide 

2 0 fragment. Drugs may be screened against such transformed cells in competitive binding or 

enzymatic inhibition assays. For example, the formation of complexes between a polypeptide 
and the agent being tested can be measured in either viable or fixed cells. 

The present invention thus provides methods of screening for drugs or any other agent 
which can be used to treat conditions associated with cancer, arthritis, or inflammation 

2 5 resulting from abnormal MMP 19 production. These methods comprise contacting the drug 

with a polypeptide or fragment thereof and assaying for either the presence of a complex 
between the agent and the polypeptide, or for the presence of a complex between the 
polypeptide and the cell. In competitive binding assays, the polypeptide typically is labeled. 
After suitable incubation, free (or uncomplexed) polypeptide or fragment thereof is separated 

3 0 from that present in bound form, and the amount of free or uncomplexed label is a measure of 

the ability of the particular drug to bind to polypeptide or to interfere with the polypeptide/cell 
complex. 

The present invention also encompasses the use of competitive drug screening assays 
in which neutralizing antibodies capable of binding polypeptide specifically compete with a test 
3 5 drug for binding to the polypeptide or fragment thereof. In this manner, the antibodies can be 
used to detect the presence of any polypeptide in the test sample which shares one or more 
antigenic determinants with a polypeptide provided herein. 
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Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to at least one polypeptide disclosed herein. 
Briefly, large numbers of different small peptide test compounds are synthesized on a solid 
phase, such as plastic pins or some other surface. The peptide test compounds are reacted 
5 with polypeptide and washed. Polypeptide thus bound to the solid phase is detected by 

methods well-known in the art. Purified polypeptide can also be coated directly onto plates for 
use in the drug screening techniques described herein. In addition, non-neutralizing antibodies 
can be used to capture the polypeptide and immobilize it on the solid support. See, for 
example, EP 84/03564, published on September 13, 1984, which is incorporated herein by 
1 0 reference. 

Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active 
polypeptides of interest or of the small molecules including agonists, antagonists, or inhibitors 
with which they interact. Such structural analogs can be used to fashion drugs which are more 

1 5 active or stable forms of the polypeptide or which enhance or interfere with the function of a 

polypeptide in vivo. J. Hodgson, Bio/Technologv 9:19-21 ( 1 99 1 ), incorporated herein by 
reference. 

For example, in one approach, the three-dimensional structure of a polypeptide, or of a 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling 

2 0 or, most typically, by a combination of the two approaches. Both the shape and charges of the 

polypeptide must be ascertained to elucidate the structure and to determine active site(s) of the 
molecule. Less often, useful information regarding the structure of a polypeptide may be 
gained by modeling based on the structure of homologous proteins. In both cases, relevant 
structural information is used to design analogous polypeptide-like molecules or to identify 

2 5 efficient inhibitors 

Useful examples of rational drug design may include molecules which have improved 
activity or stability as shown by S. Braxton et al., Biochemistry 31 :7796-7801 (1992), or 
which act as inhibitors, agonists, or antagonists of native peptides as shown by S. B. P. 
Athauda et ai., J Biochcm. (Tokyo) 113 (6):742-746 (1993), incorporated herein by reference. 

3 0 It also is possible to isolate a target -specific antibody, selected by an assay as described 

hereinabove, and then to determine its crystal structure. In principle this approach yields a 
pharmacophore upon which subsequent drug design can be based. It further is possible to 
bypass protein crystallography altogether by generating anti-idiotypic antibodies ("anti-ids") to 
a functional, pharmacologically active antibody. As a mirror image of a mirror image, the 
3 5 binding site of the anti-id is an analog of the original receptor. The anti-id then could be used 
to identify and isolate peptides from banks of chemically or biologically produced peptides. 
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The isolated peptides then can act as the pharmacophore (that is, a prototype pharmaceutical 
drug). 

A sufficient amount of a recombinant polypeptide of the present invention may be made 
available to perform analytical studies such as X-ray crystallography. In addition, knowledge 
5 of the polypeptide amino acid sequence which are derivable from the nucleic acid sequence 
provided herein will provide guidance to those employing computer modeling techniques in 
place of or in addition to x-ray crystallography. 

Antibodies specific to the MMP19 polypepeptide may further be used to inhibit the 
biological action of the polypeptide by binding to the polypeptide. In this manner, the 
1 0 antibodies may be used in therapy, for example, to treat cancer and its metastases. 

Further, such antibodies can detect the presence or absence of the MMP 19 polypeptide 
and, therefore, are useful as diagnostic markers for the diagnosis of cancers, arthritis, and 
inflammatory conditions. Such antibodies may also function as a diagnostic marker for these 
conditions. The present invention also is directed to antagonists and inhibitors of the 

1 5 polypeptides of the present invention. The antagonists and inhibitors are those which inhibit 

or eliminate the function of the polypeptide. Thus, for example, an antagonist may bind to a 
polypeptide of the present invention and inhibit or eliminate its function. The antagonist, for 
example, could be an antibody against the polypeptide which eliminates the activity of the 
MMP 19 polypeptide by binding to the MMP19 polypeptide, or in some cases the antagonist 

2 0 may be an oligonucleotide. Examples of small molecule inhibitors include but are not limited 

to small peptides or peptide-like molecules. 

The antagonists and inhibitors may be employed as a composition with a 
pharmaceutical ly acceptable carrier, including but not limited to saline, buffered saline, 
dextrose, water, glycerol, ethanol and combinations thereof. Administration of MMP 19 

2 5 polypeptide inhibitors are preferably systemic. The present invention also provides an 

antibody which inhibits the action of such polypeptide. 
Recombinant Technology . 

The present invention provides host cells and expression vectors comprising 
polynucleotides of the present invention and methods for the production of polypeptides they 

3 0 encode. Such methods comprise culturing the host cells under conditions suitable for the 

expression of the MMP1 9 polynucleotide and recovering the MMP1 9 polypeptide from the cell 
culture. ^ 

The present invention also provides vectors which include polynucleotides of the 
present invention, host cells which are genetically engineered with vectors of the present 
3 5 invention and the production of polypeptides of the present invention by recombinant 
techniques. 
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Host cells are genetically engineered (transduced or transformed or transfected) with 
the vectors of this invention which may be a cloning vector or an expression vector. The 
vector may be in the form of a plasmid, a viral panicle, a phage, etc. The engineered host cells 
can be cultured in conventional nutrient media modified as- appropriate for activating 
5 promoters, selecting transformants or amplifying the MMPI9 genes. The culture conditions, 
such as temperature, pH and the like, arc those previously used with the host cell selected for 
expression, and will be apparent to the ordinarily skilled artisan. 

The polynucleotide of the present invention may be employed for producing a 
polypeptide by recombinant techniques. Thus, the polynucleotide sequence may be included 
1 0 in any one of a variety of expression vehicles, in particular vectors or plasmids for expressing 
a polypeptide. Such vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g.. derivatives of SV40; bacterial plasmids; phage DNA; yeast plasmids; vectors 
derived from combinations of plasmids and phage DNA, viral DNA such as vaccinia, 
adenovirus, fowl pox virus, and pseudorabies. However, any other plasmid or vector may be 

1 5 used so long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a variety of 
procedures. In general, the DNA sequence is inserted into appropriate restriction endonuclease 
sites by procedures known in the art. Such procedures and others are deemed to be within the 
scope of those skilled in the art. The DNA sequence in the expression vector is operatively 

2 0 linked to an appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. 

Representative examples of such promoters include but are not limited to LTR or SV40 
promoter, the E. coli lac or trp, the phage lambda P sub L promoter and other promoters 
known to control expression of genes in prokaryotic or cukaryotic cells or their viruses. The 
expression vector also contains a ribosome binding site for translation initiation and a 

2 5 transcription terminator. The vector may also include appropriate sequences for amplifying 

expression. In addition, the expression vectors preferably contain a gene to provide a 
phenotypic trait for selection of transformed host cells such as dihydrofolate reductase or 
neomycin resistance for eukaryotic cell culture, or such as tetracycline or ampicillin resistance 
in E. coli . 

3 0 The vector containing the appropriate DNA sequence as hereinabove described, as well 

as an appropriate promoter or control sequence, may be employed to transform an appropriate 
host to permit the host to express the protein. As representative examples of appropriate hosts, 
there m:- - be mentioned: bacterial cells, such as E. coli . Salmonella tvnhimurium; 
Sireptoinvcessp. : fungal ceiis, such as yeast; insect cells such as Drosophila and Sf9; animal 
3 5 cells such as CHO, COS or Bowes melanoma; plant cells, etc. The selection of an appropriate 
host is deemed to be within the scope of those skilled 4n the art from the teachings provided 
herein. 
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More particularly, the present invention also includes recombinant constructs 
comprising one or more of the sequences as broadly described above. The constructs 
comprise a vector, such as a plasmid or viral vector, into which a sequence of the invention 
has been inserted, in a forward or reverse orientation. In a preferred aspect of this 
5 embodiment, the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and promoters 
are known to those of skill in the art, and are commercially available. The following vectors 
are provided by way of example. Bacterial: pSPORTl (Life Technologies, Gaithersburg, 
MD), pQE70, pQE60, pQE-9 (Qiagen) pBs, phagescript, psiX174, pBluescript SK, pBsKS, 
1 0 pNH8a, pNH!6a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, 
pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, pXTl , pSG 
(Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia). However, any other plasmid or 
vector may be used as long as it is replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

1 5 transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 

pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, SP6, T7, 
gpt, lambda P sub R, P sub L and trp. Eukaryotic promoters include cytomegalovirus (CMV) 
immediate early, herpes simplex virus (HSV) thymidine kinase, early and late SV40, LTRs 
from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and 

2 0 promoter is well within the level of ordinary skill in the art. 

In a further embodiment, the present invention provides host cells containing the 
above-described construct. The host cell can be a higher eukaryotic cell, such as a mammalian 
cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a prokaryotic cell, 
such as a bacterial cell. Introduction of the construct into the host ceil can be effected by 

2 5 calcium phosphate transfection, DEAE-Dextran mediated transfection, or electroporation (L. 

Davis et al., "Basic Methods in Molecular Biology", 2nd edition, Appleton and Lang, 
Paramount Publishing, East Norwalk, CT ( 1 994). 

' The constructs in host cells can be used in a conventional manner to produce the gene 
product encoded by the recombinant sequence. Alternatively, the polypeptides of the invention 

3 0 can be synthetically produced by conventional peptide synthesizers. 

Proteins can be expressed in mammalian cells, yeast^ bacteria, or other cells under the 
control of appropriate promoters. Cell-free translation systems can also be employed to 
produce such proteins using RNAs derived from the DNA constructs of the present invention. 
Appropriate cloning and expression vectors for use with prokaryutic and eukaryotic hosts are 
3 5 described by Sambrook et al., Molecular Cloning: A Lab oratory Manual. Second Edition, 
(Cold Spring Harbor, N.Y., 1989), which is hereby incorporated by reference. 
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Transcription of a DNA encoding the polypeptides of the present invention by higher 
eukaryotes is increased by inserting an enhancer sequence into the vector. Enhancers are cis- 
acting elements of DNA, usually about from 10 to 300 bp, that act on a promoter to increase its 
transcription. Examples include the SV40 enhancer on the late side of the replication origin 
5 (bp 100 to 270), a cytomegalovirus early promoter enhancer, a polyoma enhancer on the late 
side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance 
gene of E. coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed 
1 0 gene to direct transcription of a downstream structural sequence. Such promoters can be 

derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), 
alpha factor, acid phosphatase, or heat shock proteins, among others. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably, a leader sequence capable of directing secretion of translated 

1 5 protein into the periplasmic space or extracellular medium.^ Optionally, the heterologous 

sequence can encode a fusion protein including an N-terminal identification peptide imparting 
desired characteristics, e.g., stabilization or simplified purification of expressed recombinant 
product. 

Useful expression vectors for bacterial use are constructed by inserting a structural 

2 0 DNA sequence encoding a desired protein together with suitable translation initiation and 

termination signals in operable reading phase with a functional promoter. The vector will 
comprise one or more phenotypic selectable markers and an origin of replication to ensure 
maintenance of the vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli . Bacillus subtilis . Salmonella tvphimurium 

2 5 and various species within the genera Pseudomonas . Strcptomyccs . and Staphylococcus . 

although others may also be employed as a routine matter of choice. . 

Useful expression vectors for bacterial use comprise a selectable marker and bacterial 
origin of replication derived from plasmids comprising genetic elements of the well-known 
cloning vector pBR322 (ATCC 37017). Other vectors include but are not limited to PKK223- 

3 0 3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM! (Promega Biotec, Madison, Wl). 

These pBR322 "backbone" sections are combined with an appropriate promoter and the 
structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an 
appropriate cell density, the selected promoter is derepressed by appropriate means (e.g., 
3 5 temperature shift or chemical induction), and cells are cultured for an additional period. Cells 
are typically harvested by centrifugation, disrupted by physical or chemical means, and the 
resulting crude extract retained for further purification. Microbial cells employed in expression 
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of proteins can be disrupted by any convenient method, including freeze-thaw cycling, 
sonication, mechanical disruption, or use of cell lysing agents; such methods are well-known 
to the ordinary artisan. 

Various mammalian cell culture systems can also be employed to express recombinant 
protein. Examples of mammalian expression systems include the COS-7 lines of monkey 
kidney fibroblasts described by Gluzman, £ejl 23: 1 75 ( 1 98 1 ), and other cell lines capable of 
expressing a compatible vector, such as the CI 27, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable promoter and 
enhancer, and also any necessary ribosome binding sites, polyadenylation site, splice donor 
and acceptor sites, transcriptional termination sequences, and 5' flanking nontranscribed 
sequences. DNA sequences derived from the SV40 viral genome, for example, S V40 origin, 
early promoter, enhancer, splice, and polyadenylation sites may be used to provide the 
required nontranscribed genetic elements. Representative, useful vectors include pRc/CMV 
and pcDNA3 (available from Invitrogen, San Diego, CA). 

MMP19 polypeptide is recovered and purified from recombinant cell cultures by 
known methods including ammonium sulfate or ethanol precipitation, acid extraction, anion or 
cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction 
chromatography, hydroxyapatite chromatography or lectin chromatography. It is preferred to 
have low concentrations (approximately 0.1-5 mM) of calcium ion present during purification 
(Price, et ah, J. Biol. Chem . 244:9 1 7 [ 1 969]). Protein refolding steps can be used, as 
necessary, in completing configuration of the protein. Finally, high performance liquid 
chromatography (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be naturally purified products expressed 
from a high expressing cell line, or a product of chemical synthetic procedures, or produced by 
recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial, 
yeast, higher plant, insect and mammalian cells in culture). Depending upon the host 
employed in a recombinant production procedure, the polypeptides of the present invention 
may be glycosylated with mammalian or other eukaryotic carbohydrates or may be non- 
glycosylated. The polypeptides of the invention may also include an initial methionine amino 
acid residue. 

The starting plasmids can be constructed from available plasmids in accord with 
published, known procedures. In addition, equivalent plasmids to those described are known 
in the art and will be apparent to the ordinarily skilled artisan. 

1 he following is the general procedure for the isolation and analysis of cDNA clones. 
In a particular embodiment disclosed herein, an EST homologous to a portion of human 
MMPs was identified by comparison with a consensus sequence to human MMPs. 



WO 98/40475 



PCT/US98/04694 



36 

The cDNA insert was sequenced in entirety, analyzed in detail set forth in the Examples 
and is disclosed in the Sequence Listing as SEQ NO: 4. These polynucleotides may contain an 
entire open reading frame with or without associated regulatory sequences for a particular 
gene, or they may encode only a portion of the gene of interest. This is attributed to the fact 
5 that many genes are several hundred, and sometimes several thousand, bases in length and, 
with current technology, cannot be cloned in their entirety because of vector limitations, 
incomplete reverse transcription of the first strand, or incomplete replication of the second 
strand. Contiguous, secondary clones containing additional nucleotide sequence may be 
obtained using a variety of methods known to those of skill in the art. 
1 0 Methods for DNA sequencing are well known in the art. Conventional enzymatic 

methods employ DNA polymerase, Klenow fragment, Sequenase (US Biochemical Corp, 
Cleveland. OH) or Taq polymerase to extend DNA chains from an oligonucleotide primer 
annealed to the DNA template of interest. Methods have been developed for the use of both 
single-stranded and double-stranded templates. The chain termination reaction products may 

1 5 be electrophorescd on urea/polyacrylamide gels and detected either by autoradiography (for 

radionucleotide labeled precursors) or by fluorescence (for fluorescent-labeled precursors). 
Recent improvements in mechanized reaction preparation, sequencing and analysis using the 
fluorescent detection method have permitted expansion in the number of sequences that can be 
determined per day using machines such as the Applied Biosystems 377 DNA Sequencers 
Applied Biosystems, Foster City, CA). 

The reading frame of the nucleotide sequence can be ascertained by several types of 
lyses. First, reading frames contained within the coding sequence can be analyzed for the 
■nee of start codon ATG and stop codons TGA, TAA or TAG. Typically, one reading 
will continue throughout the major portion of a cDNA sequence while the other two 

2 5 ■ frames tend to contain numerous stop codons. In such cases reading frame 

o nation is straightforward. In other more difficult cases, further analysis is required. 

igorithms have been created to analyze the occurrence of individual nucleotide bases 
at eac. ative codon triplet. See, for example J. W. Fickett, Nuc Acids Res 10:5303 
(1982). iing DNA for particular organisms (bacteria, plants, and animals) tends to contain 

3 0 certain nucleotides within certain triplet periodicities, such as a significant preference for 

pyrimidines in the third codon position. These preferences have been incorporated into widely 
available software which can be used to determine coding potential (and frame) of a given 
stretch of DNA. The algorithm-derived information combined with start/stop codon 
information can be used to determine proper frame with a high degree of certainty. This, in 
3 5 turn, readily permits cloning of the sequence in the correct reading frame into appropriate 
expression vectors. 
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The nucleic acid sequences disclosed herein may be joined to a variety of other 
polynucleotide sequences and vectors of interest by means of well established recombinant 
DNA techniques. See J. Sambrook et al., supra. Vectors of interest include cloning vectors, 
such as piasmids, cosmids, phage derivatives, phagemids, as well as sequencing, replication, 
5 and expression vectors, and the like. In general, such vectors contain an origin of replication 
functional in at least one organism, convenient restriction endonuclease digestion sites, and 
selectable markers appropriate for particular host cells. The vectors can be transferred by a 
variety of means known to those of skill in the art into suitable host cells which then produce 
the desired DNA, RNA or polypeptides. 
1 0 Occasionally, sequencing or random reverse transcription errors. will mask the presence 

of the appropriate open reading frame or regulatory element. In such cases, it is possible to 
determine the correct reading frame by attempting to express the polypeptide and determining 
the amino acid sequence by standard peptide mapping and sequencing techniques. See, F. M. 
Ausubel, et al., Current Protocols in Molecular Biology . John Wiley & Sons, New York, NY 

1 5 (1989). Additionally, the actual reading frame of a given nucleotide sequence may be 

determined by transfection of host cells with vectors containing all three potential reading 
frames. Only those cells with the nucleotide sequence in the correct reading frame will 
produce a peptide of the predicted length. 

The nucleotide sequences provided herein have been prepared by current, state-of-the- 

2 0 art, automated methods and as such may contain unidentified nucleotides. These will not 

present a problem to those skilled in the art who wish to practice the invention. Several 
methods employing standard recombinant techniques, described in J. Sambrook f supra ) or 
periodic updates thereof, may be used to complete the missing sequence information. The 
same techniques used for obtaining a full length sequence, as described herein, may be used to 

2 5 obtain nucleotide sequence. 

Expression of a particular cDNA may be accomplished by subcloning the cDNA into 
an appropriate expression vector and transfecting this vector into an appropriate expression 
host. The cloning vector used for the generation of the cDNA library can be used for 
transcribing mRNA of a particular cDNA and contains a promoter for beta-galactosidase, an 

3 0 amino-terminal met and the subsequent seven amino acid residues of beta-galactosidase. 

Immediately following these eight residues is an engineered bacteriophage promoter useful for 
artificial priming and transcription and a number of unique restriction sites, including EcoR I, 
for cloning. The vector can be transfected into an appropriate host strain of E. coli. 

Induction of the isolated bacterial strain with isopropylthiogalactosidc (IPTG) using 
3 5 standard methods will produce a fusion protein which contains the first seven residues of beta- 
galactosidase, about 15 residues of linker, and the peptide encoded within the cDNA. Since 
cDNA clone inserts are generated by an essentially random process, there is one chance in 
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three that the included cDNA will lie in the correct frame for proper translation. If thecDNA is 
not in the proper reading frame, the correct frame can be obtained by deletion or insertion of an 
appropriate number of bases by well known methods including in vitro mutagenesis, digestion 
with exonuclease III or mung bean nuclease, or oligonucleotide linker inclusion. 
5 The cDNA can be shuttled into other vectors known to be useful for expression of 

protein in specific hosts. Oligonucleotide primers containing cloning sites and segments of 
DNA sufficient to hybridize to stretches at both ends of the target cDNA can be synthesized 
chemically by standard methods. These primers can then be used to amplify the desired gene 
segments by PCR. The resulting new gene segments can be digested with appropriate 
1 0 restriction enzymes under standard conditions and isolated by: gel electrophoresis. Alternately, 
similar gene segments can be produced by digestion of the cDNA with appropriate restriction 
enzymes and filling in the missing gene segments with chemically synthesized 
oligonucleotides. Segments of the coding sequence from more than one gene can be ligated 
together and cloned in appropriate vectors to optimize expression of recombinant sequence. 

1 5 Suitable expression hosts for such chimeric molecules include but are not limited to, 

mammalian cells such as Chinese Hamster Ovary (CHO) and human 293 cells, insect cells 
such as Sf9 cells, yeast cells such as Saccharomvces cerevisiae . and bacteria such as E. coli . 
For each of these cell systems, a useful expression vector may also include an origin of 
replication to allow propagation in bacteria and a selectable marker such as the beta-lactamase 

2 0 antibiotic resistance gene to allow selection in bacteria. In addition, the vectors may include a 

second selectable marker such as the neomycin phosphotransferase gene to allow selection in 
transfected eukaryotic host ceils. Vectors for use in eukaryotic expression hosts may require 
the addition of 3' poly A tail if the sequence of interest lacks poly A. 

Additionally, the vector may contain promoters or enhancers which increase gene 

2 5 expression. Such promoters are host specific and include but are not limited to MMTV, 

SV40, or metal lothionine promoters for CHO cells; trp, lac, tac or T7 promoters for bacterial 
hosts; or alpha factor, alcohol oxidase or PGH promoters for yeast. Adenoviral vectors with 
or without transcription enhancers, such as the rous sarcoma virus (RSV) enhancer, may be 
used to drive protein expression in mammalian cell lines. Once homogeneous cultures of 

3 0 recombinant cells are obtained, large quantities of recombinantly produced protein can be 

recovered from the conditioned medium and analyzed using chromatographic methods well 
known in the art. An alternative method for the production of large amounts of secreted 
protein involves the transformation of mammalian embryos and the recovery of the 
recombinant protein from milk produced by transgenic cows, goats, sheep, etc. Polypeptides 
3 5 and closely related molecules may be expressed recombinantly in such a way as to facilitate 
protein purification. One approach involves expression of a chimeric protein which includes 
one or more additional polypeptide domains not naturally present on human polypeptides. 
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Such purification-facilitating domains include, but are not limited to, metal-chelating peptides 
such as histidine-tryptophan domains that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp, Seattle, WA). The inclusion 
5 of a cleavable linker sequence such as Factor XA or enterokinase from Invitrogen (San Diego, 
CA) between the polypeptide sequence and the purification domain may be useful for 
recovering the polypeptide. 
Immunoassays. 

The polypeptides including their fragments or derivatives or analogs thereof of the 
1 0 present invention, or cells expressing them, can be in a variety of assays, many of which are 
described herein, for the detection of antibodies. They also can be used as an immunogen to 
produce antibodies. These antibodies can be, for example, polyclonal or monoclonal 
antibodies, chimeric, single chain and humanized antibodies, as well as. Fab fragments, or the 
product of an Fab expression library. Various procedures known in the art may be used for 

1 5 the production of such antibodies and fragments. 

For example, antibodies generated against a polypeptide corresponding to a sequence 
of the present invention can be obtained by direct injection of the polypeptide into an animal or 
by administering the polypeptide to an animal such as a mouse, rabbit, goat or human. A 
mouse, rabbit or goat is preferred. The antibody so obtained then will bind the polypeptide 

2 0 ' itself. In this manner, even a sequence encoding only a fragment of the polypeptide can be 

used to generate antibodies that bind the native polypeptide. Such antibodies can then be used 
to isolate the polypeptide from test samples such as tissue suspected of containing that 
polypeptide. For preparation of monoclonal antibodies, any technique which provides 
antibodies produced by continuous cell line cultures can be used. Examples include the 

2 5 hybridoma technique as described by Kohler and Milstein, Nature 256:495-497 (1975), the 

trioma technique, the human B-cell hybridoma technique as described by Kozbor et al, 
Tmmun. Today 4:72 (1983), and the EBV-hybridoma technique to produce human monoclonal 
antibodies as described by Cole, et al., in Mnnodonal Antibodies and Cancer Therapy , Alan 
R. Liss, Inc, New York, NY, pp. 77-96 (1985). Techniques described for the production of 

3 0 single chain antibodies can be adapted to produce single chain antibodies to immunogenic 

polypeptide products of this invention. Sec, for example, U.S. Pat. No. 4,946,778, which is 
incorporated herein by reference. 

Various assay formats may utilize the antibodies of the present inver : on, including 
"sandwich" immunoassays and probe assays. For example, the monoclonal antibodies or 
3 5 fragment thereof of the present invention can be employed in various assay systems to 

determine the presence, if any, of MMP 19 polypeptide in a test sample. For example, in a first 
assay format, a polyclonal or monoclonal antibody or fragment thereof, or a combination of 
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these antibodies, which has been coated on a solid phase, is contacted with a test sample, to 
form a first mixture. This first mixture is incubated for a time and under conditions sufficient 
to form antigen/antibody complexes. Then, an indicator reagent comprising a monoclonal or a 
polyclonal antibody or a fragment thereof, or a combination of these antibodies, to which a 
5 signal generating compound has been attached, is contacted with the antigen/antibody 

complexes to form a second mixture. This second mixture then is incubated for a time and 
under conditions sufficient to form antibody/antigen/antibody complexes. The presence of a 
MMP1 9 polypeptide antigen present in the test sample and captured on the solid phase, if any, 
is determined by detecting the measurable signal generated by the signal generating compound. 
1 0 The amount of MMP1 9 polypeptide antigen present in the test sample is proportional to the 
signal generated. . 

Or, a polyclonal or monoclonal MMP19 polypeptide antibody or fragment thereof, or a 
combination of these antibodies which is bound to a solid support, the test sample and an . 
indicator reagent comprising a monoclonal or polyclonal antibody or fragments thereof, which 

1 5 specifically binds to MMP 1 9 polypeptide antigen, or a combination of these antibodies to 

which a signal generating compound is attached, are contacted to form a mixture. This mixture 
is incubated for a time and under conditions sufficient to form antibody/antigen/antibody 
complexes. The presence, if any, of MMP 19 polypeptide present in the test sample and 
captured on the solid phase is determined by detecting the measurable signal generated by the 

2 0 signal generating compound. The amount of MMP 19 polypeptide proteins present in the test 

sample is proportional to the signal generated. 

In another assay format, one or a combination of at least two monoclonal antibodies of 
the invention can be employed as a competitive probe for the detection of antibodies to MMP 1 9 
polypeptide protein. For example, MMP19 polypeptide proteins such as the recombinant 

2 5 antigens disclosed herein, either alone or in combination, are coated on a solid phase. A test 

sample suspected of containing antibody to.MMP19 polypeptide antigen then is incubated with 
an indicator reagent comprising a signal generating compound and at least one monoclonal 
antibody of the invention for a time and under conditions sufficient to form antigen/antibody ' 
complexes of either the test sample and indicator reagent bound to the solid phase or the 

3 0 indicator reagent bound to the solid phase. The reduction in binding of the monoclonal 

antibody to the solid phase can be quantitatively measured. 

In yet another detection method, each of the monoclonal or polyclonal antibodies of the 
present invention can be employed in the detection of MMP 19 polypeptide antigens in fixed 
tissue sections, as well as fixed cells by immunohistochemical analysis. CytiK--.rr.ical 
3 5 analysis wherein these antibodies are labeled directly (with, for example, fluorescein, colloidal 
gold, horseradish peroxidase, alkaline phosphatase, etc.) or are labeled by using secondary 
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labeled anti-species antibodies (with various labels as exemplified herein) to track the " 
histo pathology of disease also are within the scope of the present invention. 

In addition, these monoclonal antibodies can be bound to matrices similar to CNBr- 
activated Sepharose and used for the affinity purification of specific MMP19 polypeptide 
5 proteins from cell cultures or biological tissues such as to purify recombinant and native 
MMP 19 polypeptide antigens and proteins. 

The monoclonal antibodies of the invention can also be used for the generation of 
chimeric antibodies for therapeutic use, or other similar applications. 

The monoclonal antibodies or fragments thereof can be provided individually to detect 
1 0 MMP1 9 polypeptide antigens. Combinations of the monoclonal antibodies (and fragments 

thereof) provided herein also may be used together as components in a mixture or "cocktail" of 
at least one MMP19 polypeptide antibody of the invention with antibodies to other MMP 19 . 
polypeptide regions, each having different binding specificities. Thus, this cocktail can 
include the monoclonal antibodies of the invention which are directed to MMP19 polypeptide 

1 5 proteins and other monoclonal antibodies to other antigenic determinants of the MMP19 

polypeptide genome. 

The polyclonal antibody or fragment thereof which can be used in the assay formats 
should specifically bind to a MMP 19 polypeptide region or other MMP19 polypeptide proteins 
used in the assay. The polyclonal antibody used preferably is of mammalian origin; human, 

2 0 goat, rabbit or sheep anti- MMP 19 polypeptide polyclonal antibody can be used. Most 

preferably, the polyclonal antibody is rabbit polyclonal anti-MMP19 polypeptide antibody. 
The polyclonal antibodies used in the assays can be used either alone or as a cocktail of 
polyclonal antibodies. Since the cocktails used' in the assay formats arc comprised of either 
monoclonal antibodies or polyclonal antibodies having different MMP 19 polypeptide 

2 5 specificity, they would be useful for diagnosis, evaluation and prognosis of MMP 19 

polypeptide condition, as well as for studying MMP 19 polypeptide protein differentiation and 
specificity. 

It is contemplated and within the scope of the present invention that the MMP19 
polypeptide may be detectable in assays by use of a recombinant antigen as well as by use of a 

3 0 synthetic peptide or purified peptide, which contains an amino acid sequence of MMP ! 9 

polypeptide. It also is within the scope of the present invention that different synthetic, 
recombinant or purified peptides identifying different epitopes of the MMP 19 polypeptide can 
be used in combination in an assay to diagnose, evaluate, or prognose the disease condition. 
In this case, these peptides can be coated onto one solid phase, or each separate peptide may be ■ 
3 5 coated on separate solid phases, such as microparticles, and then combined to form a mixture 
of peptides which can be later used in assays. Furthermore, it is contemplated that multiple 
peptides which define epitopes from different polypeptides may be used in combination to 
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make a diagnosis, evaluation, or prognosis of disease. Peptides coated on solid phases or. 
labelled with detectable lableS are then allowed to compete with peptides from a patient sample 
for a limited amount of antibody. A reduction in binding of the synthetic, recombinant, or 
purified peptides to the antibody (or antibodies) is an indication of the presence of MMP19 
polypeptides in the patient sample which in turn indicates the presence of disease in the patient. 
Such variations of assay formats are known to those of ordinary skill in the an and are 
discussed herein below. 

In another assay format, the presence of antibody and/or antigen to MMP19 
polypeptide can be detected in a simultaneous assay, as follows. A test sample is 
simultaneously contacted with a capture reagent of a first analyte, wherein said capture reagent 
comprises a first binding member specific for a first analyte attached to a solid phase and a 
capture reagent for a second analyte, wherein said capture reagent comprises a first binding 
member for a second analyte attached to a second solid phase, to thereby form a mixture. This 
mixture is incubated for a time and under conditions sufficient to form capture reagent/first 
analyte and capture reagent/second analyte complexes. These so-formed complexes then are 
contacted with an indicator reagent comprising a member of a binding pair specific for the first 
analyte labeled with a signal generating compound and an indicator reagent comprising a 
member of a binding pair specific for the second analyte labeled with a signal generating • 
compound to form a second mixture. This second mixture is incubated for a time and under 
conditions sufficient to form capture reagent/first analyte/indicator reagent complexes and 
capture reagent/second analyte/indicaior reagent complexes. The presence of one or more 
analytes is determined by detecting a signal generated in connection with the complexes formed 
on either or both solid phases as an indication of the presence of one or more analytes in the 
test sample. In this assay format, recombinant antigens derived from human expression 
systems may be utilized as well as monoclonal antibodies produced from the proteins derived 
from the mammalian expression systems as disclosed herein. Such assay systems are 
described in greater detail in EP Publication No. 0473065. 

In yet other assay formats, the polypeptides disclosed herein may be utilized to detect 
the presence of ami- MMP19 polypeptide in test samples. For example, a test sample is 
incubated with a solid phase to which at least one recombinant protein has been attached. 
These are reacted for a time and under conditions sufficient to form antigen/antibody 
complexes. Following incubation, the antigen/antibody complex is detected. Indicator 
regents may be used to facilitate detection, depending upon the assay system chosen. In 
.i::-.i:her assay format, a test sample is contacted with a solid phase to which a recombinant 
protein produced as described herein is attached and also is contacted with a monoclonal or 
polyclonal antibody specific for the protein, which preferably has been labeled with an 
indicator reagent. After incubation for a time and under conditions sufficient for 
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antibody/antigen complexes to form, the solid phase is separated from the free phase, and the 
label is detected in either the solid or free phase as an indication of the presence of MMP19 
polypeptide antibody. Other assay formats utilizing the recombinant antigens disclosed herein 
are contemplated. These include contacting a test sample with a solid phase to which at least 
one antigen from a first source has been attached, incubating the solid phase and test sample 
for a time and under conditions sufficient to form antigen/antibody complexes, and then 
contacting the solid phase with a labeled antigen, which antigen is derived from a second 
source different from the first source. For example, a recombinant protein derived from a first 
source such as £. col; is used as a capture antigen on a solid phase, a test sample is added to 
the so-prepared solid phase, and a recombinant protein derived from a different source (i.e., 
non-E. coli) is utilized as a part of an indicator reagent. Likewise, combinations of a 
recombinant antigen on a solid phase and synthetic peptide in the indicator phase also are 
possible. Any assay format which utilizes an antigen specific for MMP19 polypeptide from a 
first source as the capture antigen and an antigen specific for MMP19 polypeptide from a 
different second source are contemplated. Thus, various combinations of recombinant 
antigens, as well as the use of synthetic peptides, purified proteins, and the like, are within the 
scope of this invention. Assays such as this and others are described in U.S. Patent No. 
5,254,458, which enjoys common ownership and is incorporated herein by reference. 

Other embodiments which utilize various other solid phases also are contemplated and 
are within the scope of this invention. For example, ion capture procedures for immobilizing 
an immobilizable reaction complex with a negatively charged polymer (described in EP 
publication 0326100 and EP publication No. 0406473), can be employed according to the 
present invention to effect a fast solution-phase immunochemical reaction. An immobilizable 
immune complex is separated from the rest of the reaction mixture by ionic interactions 
between the negatively charged poly-anion/immune complex and the previously treated, 
positively charged porous matrix and detected by using various signal generating systems 
previously described, including those described in chemi luminescent signal measurements as 
described in EPO Publication No. 0 273,1 15. 

Also, the methods of the present invention can be adapted for use in systems which 
utilize microparticle technology including in automated and semi-automated systems wherein 
the solid phase comprises a microparticle (magnetic or non-magnetic). Such systems include 
those described in published EPO applications Nos. EP 0 425 633 and EP 0 424 634, 
respectively. 

The use of scanning probe microscopy (SPM) for immunoassays also is a technology 
to which the monoclonal antibodies of the present invention are easily adaptable. In scanning 
probe microscopy, in particular in atomic force microscopy, the capture phase, for example, at 
least one of the monoclonal antibodies of the invention, is adhered to a solid phase and a 



WO 98/40475 



PCT7US98/04694 



scanning probe microscope is utilized to detect antigen/antibody complexes which may be 
present on the surface of the solid phase. The use of scanning tunneling microscopy 
eliminates the need for labels which normally must be utilized in many immunoassay systems 
to detect antigen/antibody complexes. The use of SPM to monitor specific binding reactions 
5 can occur in many ways. In one embodiment, one member of a specific binding partner 

(analyte specific substance which is the monoclonal antibody of the invention) is attached to a 
. surface suitable for scanning. The attachment of the analyte specific substance may be by 
adsorption to a test piece which comprises a solid phase of a plastic or metal surface, following 
methods known to those of ordinary skill in the art. Or, covalent attachment of a specific 
1 0 binding partner (analyte specific substance) to a test piece which test piece comprises a solid 
phase of derivatized plastic, metal, silicon, or glass may be utilized. Covalent attachment- 
methods are known to those skilled in the art and include a variety of means to irreversibly link 
specific binding partners to the test piece. If the test piece is silicon or glass, the surface must 
be activated prior to attaching the specific binding partner. Also, polyeiectrolyte interactions 

1 5 may be used to immobilize a specific binding partner on a'surface of a test piece by using 

techniques and chemistries. The preferred method of attachment is by covalent means. 
Following attachment of a specific binding member, the surface may be further treated with 
materials such as serum, proteins, or other blocking agents to minimize non-specific binding. 
The surface also may be scanned either at the site of manufacture or point of use to verify its 

2 0 suitability for assay purposes. The scanning process is not anticipated to alter the specific 

binding properties of the test piece. 

While the present invention discloses the preference for the use of solid phases, it is 
contemplated that the reagents such as antibodies, proteins and peptides of the present 
invention can be utilized in non-solid phase assay systems. These assay systems are known to 

2 5 those skilled in the art, and are considered to be within the scope of the present invention. 

It is contemplated that the reagent employed for the assay can be provided in the form 
of a test kit with one or more containers such as vials or bottles, with each container containing 
a separate reagent such as a probe, primer, monoclonal antibody or a cocktail of monoclonal 
antibodies, or a polypeptide (either recombinant or synthetic) employed in the assay. Other 

3 0 components such as buffers, controls, and the like, known to those of ordinary skill in art, 

may be included in such test kits. It also is contemplated to provide test kits which have means 
for collecting test samples comprising accessible body fluids, eg. blood, urine, saliva, and 
stool. Such collection means include lancets and absorbent paper or cloth for coi 1 -' Jting and 
stabilizing blood; swabs for collecting and stabilizing saliva; cups for collecting and stabilizing 
3 5 urine or stool samples. Collection materials, papers, cloths, swabs, cups and the like, may 
optionally be treated to avoid denaturation or irreversible adsorption of the sample. The 
collection materials also may be treated with or contain preservatives, stabilizers or 
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antimicrobial agents to help maintain the integrity of the specimens. Test kits designed for the 
collection, stabilization, and preservation of test specimens obtained by surgery or needle 
biopsy are also useful. It is contemplated that all kits may be configured in two components; 
one component for collection and transport of the specimen, and the other component for the 
5 analysis of the specimen. Further, kits for the collection, stabilization, and preservation of test 
specimens may be configured for use by untrained personnel and may be available in the open 
market for use at home with subsequent transportation to a laboratory for analysis of the test 
sample. 

E. coli bacteria (clone #907334) has been deposited at the American Type Culture 
1 0 Collection (A.T.C.C.), 12301 Parklawn Drive, Rockville, Maryland 20852, as of March 10, 
1997, under the terms of the Budapest Treaty and will be maintained for a period of thirty (30) 
years from the date of deposit, or for five (5) years after the last request for the deposit, or for 
the enforceable period of the U.S. patent, whichever is longer. The deposit and any other 
deposited material described herein are provided for convenience only, and are not required to 

1 5 practice the present invention in view of the teachings provided herein. The cDNA sequence in 

all of the deposited material is incorporated herein by reference. Clone #907334 was accorded 

A.T.C.C. Deposit No . 

Having now generally described the invention, a complete understanding can be 
obtained by reference to the following specific examples. The following examples are given 

2 0 for th- purpose of illustrating various embodiments of the invention and are not meant to limit 

the present invention in any fashion. 

Example 1 : Isolmion of Human cDNA Clones Homol ogous to MMPs 

2 5 A consensus of twelve human matrix metalloprotease amino acid sequences was 

obtained by aligning sequences using the PILEUP program in the Wisconsin Sequence 
Analysis Package (Genetics Computer Group [GCG], Madison, WI). The consensus 
sequence was derived from a plurality of seven sequences at a given amino acid position. The 
letter "x" was substituted for each amino acid that differed from the determined consensus 

3 0 residue and was used to represent any amino acid. 

The MMP consensus sequence (SEQ ID NO:7, sec FIG. 1) was used to search the 
LifeSeq™ human expression database (Incyte Pharmaceuticals, Inc., Palo Alto, CA) for 
human MMP sequences. This database is comprised of partial sequences of cDNA clone 
inserts, so-called expressed sequence tags (ESTs), derived ironi cDNA libraries from various 
3 5 human tissues. A search of LIFESEQ™ using the BLAST and Smith- Waterman similarity 

search algorithms with the MMP consensus sequence identified an EST (from clone #907334) 
unique to the LIFESEQ™ database and whose deduced amino acid sequence of one reading 
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frame contained the amino acid sequence PRCGVTD (SEQ ID NO:8), similar to the "cysteine 
switch" motif in the propeptide region of all MMPs. The EST from clone #907334 was 
derived from a colon tissue library using oligo-dT for the reverse transcription reaction. The 
cDNA clone #907334 in pSPORTl plasmid (GIBCO BRL. Gaithersburg, MD) was obtained 
5 for further study. 

Clone #907334 was amplified in bacteria and digested with the restriction enzymes 
EcoRI and Notl to determine the size of the cDNA insert. Insert length was determined to be 
approximately 1900 base pairs. The complete nucleotide sequence of the clone (shown in 
FIG. 4) was obtained by initially sequencing with plasmid primers and subsequently with 

1 0 primers near the end of each newly obtained sequence. All sequence information from each 

reaction was compiled and analyzed using the Sequencher™ program (Gene Codes 
Corporation, Ann Arbor, MI). Sequencing was continued until both strands of the cloned 
cDNA insert were sequenced in entirety. In FIG. 4, the sense strand (SEQ ID NO: 1 ) is shown 
on top with its complement (SEQ ID NO:2) aligned directly beneath. 
15 *■ •■■ The deduced amino acid sequence of clone #907334 (SEQ ID NO: 10, also shown in 

FIG. 4) encoded a polypeptide of at least 470 amino acids but lacked an initiating methionine 
start codon and a portion of the putative pro-peptide domain. The amino acid sequence was 
compared to those of other MMPs (FIG. 5) and shown to contain a cysteine switch motif 
(SEQ ID NO:8) within the putative pro-peptide domain which was 86% homologous to the 

2 0 consensus cysteine switch motif of the other MMPs (PRCGVPD, SEQ ID NO: II). In 

addition, MMPI9 contained an identical version (100%) of the highly conserved zinc binding 
consensus sequence (HEXaaGHXaaLGLXaaHS, SEQ ID NO: 12) found in the putative 
catalytic domain. The sequence also contained a potential recognition and cleavage site 
(RXKR, SEQ ID NO:33) for Kex2-like proteases shown to be involved in enzyme activation 

2 5 of stromelysin 3 and the membrane-type MMPs. 

Example 2: Localization of Clone #907334 Transcripts 

A. Nothem Blotting : The entire nucleotide sequence of clone #907334 (hereafter 

3 0 referred to as MMP19) was compared to LIFESEQ™ and other databases (for example 

GenBank, Wash. U./Merck EST) using the BLAST and Smith-Waterman programs. Public 
databases did not contain sequences identical to MMP19 cDNA. LIFESEQ™ contained eight 
additional ESTs identical to some portion of MMP19 cDNA (FIG. 6). The nine ESTs in 
LIFESEQ™ which were identical to MMPi9 DNA had been sequenced from cDNA libraries 
3 5 derived from 8 different tissues, including colon, lung, prostate, spinal cord, ovary and 

prostate tumor, brain tumor and ganglioneuroma (FIG. 6) Thus, these tissues expressed at 
least small amounts of the mRNA encoded by MMP19 cDNA. 
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In order to determine whether MMP19 mRNA sequences could be detected by 

32 

hybridization, the entire 1717 bp fragment of MMP19 cDNA was radiolabeled with P using 
a commercial random primer labeling kit (Pharmacia, Piscataway, NJ). Specific activity of the 
labeled fragment was determined to be -5 x 10 8 cpm/|ig DNA. The labeled cDNA was used 
5 as a probe to hybridize to mRNA (2 \ig polyA+ RNA) on Northern blots (specifically, 

Multiple Tissue Northerns 1 and 2, obtained from Clontech, Palo Alto, CA) which contained 
various human tissue RNA. The blots were prehybridized at 60°C for 1 hour in Express Hyb 
solution (supplied with the blots) and hybridized (also in Express Hyb solution) at the same 
temperature for two hours in the presence of denatured probe at ! x 1 0 cprn/mL. After 
1 0 washing the blots once in 2X SSPE + 0.1% SDS (20 min), and twice under stringent 

conditions (0.2X SSPE + 0. 1 % SDS, 50°C, 20 min. each wash), the filters were exposed and 
analyzed on a Phosphorimager (Molecular Dynamics, Sunnyvale, CA). By Northern blotting, 
it was determined that MMPI9 mRNA transcript is approximately 2.4 kb and is present in 
human testes and to a lesser extent in human colon, ovary, small intestine, and prostate tissue 

1 5 (FIG. 1 1 ). Smaller less abundant transcripts with similar tissue distribution were also 

observed. 

B. Ribonuclease Protection Assay : Alternatively, instead of or in addition to 
performing a Northern blot, a ribonuclease protection assay may be performed to determine 
whether MMP19 is present in particular tissues. A ribonuclease protection assay is performed 

2 0 as follows: 

1. Labeling of Complementary RNA fcRNA) Hybridization Probes . Labeled 
sense and antisense riboprobes are transcribed from the EST sequence which contains an RNA 
polymerase promoter such as SP6 or T7. The sequence may be from a vector containing the 
appropriate EST insert or from a PCR-generated product of the insert using PCR primers 

2 5 which incorporate am RNA polymerase promoter sequence. The transcripts are prepared in a 

20 |iL reaction volume containing 1 p.g of DNA template, 2 |iL of 100 mM dithiothreitol, 0.8 
pX of RNasin (10-40U), 500 pJvl each of ATP, CTP, GTP, 5 pL (alpha 32 P) UTP or 100-500 
(iM biotinylated UTP, and 1 uX of RNA polymerase in transcription buffer (40 mM Tris-HCl, 
pH 7.5, 6 mM MgCl2, 2 mM spermidine HC1, 5 mM NaCl). Following incubation at 37~C 

3 0 for one hour, the transcripts are treated with DNase I (15 U) for an additional 30 min to digest 

the template. The probes then are isolated by spin columns, salt precipitation or 
electrophoresis techniques which are well-known in the art. Finally, the probes are dissolved 
in lysis bufkv (5 M Guanidine Thiocyanate, 0.1 M EDTA, pH 7.0). 

2. Hybridization nf 1 aheled Pro be to Target. Approximately 20 \ig of 

3 5 extracted total cellular RNA, prepared as described in Sambrook, et al supra, is placed in 10 
uX of lysis buffer and mixed with either (i) 1 x 10 5 cpm of radioactivcly labeled probe or (ii) 
250 pg of non-isotopicatly labeled probe, each in 2 uX of lysis buffer. The mixture then is 
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incubated at 60°°C for 5 min and hybridized overnight at room temperature. See, T. Kaabache 
et al.,Anai Biochem. 232: 225-230 (1995). 

3. RNase Digestion - Hybridizations are terminated by incubation with 380 fiL 
of a solution containing 40 (Xg/mL RNase A and 625 U/mJL RNase Tl in 1 mM EDTA, 300 

5 mM NaCI, 30 mM Tris-HCl pH 7.4 for 45-60 min at room temperature. RNase digestion then 
is terminated by the addition of 60 |iL of proteinase-K (1 .7 mg/mL) containing 3.3% SDS, 
followed by incubation for 30 min at 37°C. The digested mixture then is extracted with 
phenol:chloroform:isoamyl alcohol to remove protein. The mRNA:cRNA hybrids are 
precipitated from the aqueous phase by the addition 4 }ig yeast tRNA and 800 p.L of ethanol, 

1 0 and incubation at -80°C for 30 min. The precipitates are collected by centrifugation. 

4. Fragment Analysis . The precipitates are dissolved in 5 \iL of denaturing gel 
loading dye (80% formamide, 10 mM EDTA, pH 8.0, 1 mg/mL xylene cyanol, 1 mg/mL 
bromophenol blue) and electrophoresed in 6 % polyacrylamide TBE, 8 M urea denaturing 
gels. The gels are dried under vacuum and autoradiographed. Quantitation can be performed 

15 - by comparing the counts obtained from the test samples to a calibration curve that was 

generated by utilizing calibrators that are the sense strand. In cases where non-isotopic labels 
are used, hybrids are transferred from the gels to membranes (nylon or nitrocellulose) by 
blotting and then analyzed using detection systems that employ streptavidin alkaline 
phosphatase conjugates and chemiluminesence or chemifluoresence reagents. Again, 
_ 2 0 expression of an mRNA which is detectable by the labeled probe in a particular tissue suggests 
that MMPI9 is produced in that tissue. 

C. Dot Blot/Slot Blot : Dot and slot blot assays are also quick methods to evaluate the 
presence of a specific nucleic acid sequence in a complex mix of nucleic acid. To perform, up 
to 20 jig of RNA is mixed in 50 uX of 50% formamide, 7% formaldehyde, IX SSC, incubate 

2 5 15 min at 68°C and cool on ice. Then, 100 uX of 20X SSC is added to the RNA mixture and 

loaded under vacuum onto a manifold apparatus that has a prepared nitrocellulose or nylon 
membrane. The membrane is soaked in water, 20X SSC for I hour, placed on two sheets of 
20X SSC prewet Whatman #3 filter paper, and loaded into a slot blot or dot blot vacuum 
manifold apparatus. The slot blot is analyzed with probes prepared and labeled as described 

3 0 supra. Other methods and buffers not specifically detailed here are described in J. Sambrook 

etal., supra. 

D. In Situ Hybridization : This method is useful to directly detect specific target 
nucleic acid sequences in cells using detectable nucleic acid hybridization probes. 

Tissues are prepared with cross-linking fixatives agents such as paraformaldehyde or 
3 5 glutaraldehyde for maximum cellular RNA retention. See, L. Angerer et ai t Methods in Cell 
Biol. 35: 37-7! (1991). Briefly, the tissue is placed in greater than 5 volumes of 1% 
glutaraldehyde in 50 mM sodium phosphate, pH 7.5 at 4°C for 30 min. The solution is 
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changed with fresh solution for a further 30 min fixing. The fixing solution should have an 
osmolality of approximately 0.375% NaCl. The tissue is washed once in isotonic NaCl to 
remove the phosphate. 

The fixed tissues then are embedded in paraffin, as follows. The tissue is dehydrated 
through a series of ethanol concentrations for 15 min each: 50% twice, 70% twice, 85%, 90% 
and 100% twice. The tissue next is soaked in two changes of xylene for 20 min each at room 
temperature; then it is soaked in two changes of 1 xylene: 1 paraffin for 20 min each at 60°C; 
and then it is soaked in three final changes in paraffin for 15 min each. 

The tissue next is cut in 5 fim sections using a standard microtome and placed on a 
slide previously treated with the tissue adhesive 3-aminopropyltriethoxysilane. 

Paraffin is removed from the tissue by two 10 min xylene soaks and rehydrated in a 
series of ethanol concentrations; 99% twice, 95%, 85%, 70%, 50%, 30% and distilled water 
twice. The sections are pre-treated with 0.2 M HC1 for 10 min and permeabilized with 2 
u.g/mL Proteinase-K at 37°C for 1 5 min. 

Labeled riboprobes transcribed from the pSPORTl plasmid containing fragments of 
MMP19 cDNA are hybridized to the prepared tissue sections and hybridized overnight at 
56~C in 3X standard saline extract and 50% formamide. Excess probe is removed by 
washing in 2X standard saline citrate and 50% formamide followed by digestion with 100 
Hg/mL RNase A at 37o°C for 30 min. Fluorescence probe is visualized by illumination with 
UV light under a microscope. Fluorescence in the cytoplasm is indicative of mRNA 
production. Fluorescence in the nucleus detects the presence of genomic material. 
Alternatively, the sections can be visualized by autoradiography. 

fixnmple 3: Expression of MMP19 Sequences 

A. Construction of Expression V e ^nrs Hontaininr ON A Fragments Encoding the 
rntalvtic Dom ain s nf MMP1Q Protein : The MMP 19 plasmid was used as template in PCR 
reactions to generate DN A fragments encoding MMP19 protein for introduction into two 
cukaryotic expression vectors, pcDNA3.1 (Invitrogen, San Diego, CA) and pCINeo 
(Promega, Madison, WI). The two upstream primers used in PCR reactions, SEQ ID NO:3 
(5'-GTATCTCTAGACACCATGTTTGCAAAGCAAGGTAACAAATGGTACAAGC-3') and 
SEQ ID NO:4 (S'-GTATCTCTAGACACCATGAAGGTTCTGTGGGCTGCGTTGCTGGT- 
CACATTCCTGGCAGGATGCCAGGCCTTTGCAAAGCAAGGTAACAAATGG-3*) 
contained sequences to create an XbaJ restriction enzyme site, a Kozak consensus sequence for 
translation initiation, an ATG start codon, and a sequence of nucleotides from nucleotide 
position 2 1 4 to nucleotide position 244 of SEQ ID NO: 1, which corresponded to the N- 
terminus of the putative catalytic domain of the enzyme. SEQ ID NO:4 also contained a 



WO 98/40475 



PCT/US98/04694 



sequence which encodes a signal peptide (from the secreted protein, apoiipoprotein E), for 
transport of the translated protein into the secretory pathway. The signal peptide sequence was 
placed upstream from and in frame with the sequence encoding the N-terminus of the putative 
catalytic domain. Downstream primers used in PCR reactions were either SEQ ID NO:5 (5'- 
5 GTACTTCTAGACTACTTGTCAT-CGTCGTCCTTGTAGTCACCACCGAACAGGGCG- 
CTCCCCGAGTTGGCATGCC^) which encodes an epitope recognized by anti-FLAG M2, 
(available from Sigma Biosciences, St. Louis, MO) or SEQ ID NO:6 (5 * -GT ACTTCT AG A- 
GATCTTCTTCACTGATCAGCTTCTGTTCACCACCGAACAGGGCGCTCCCCGAGTTG 
-GCATGCC-3') which encodes an epitope recognized by anti-c-Myc (Santa Cruz 
1 0 Biotechnology, Santa Cruz, CA). In both downstream primers, these epitope sequences were 
in frame with the last codon in the open reading frame of the cDNA and were followed by a 
stop codon (TGA) and an Xbal restriction enzyme site. 

PCR amplifications were performed using primers sets SEQ ID NOS:3 and 5. SEQ ID 
NOS:3 and 6, SEQ ID NOS:4 and 5 and SEQ ID NOS:4 and 6 under standard PCR 

1 5 conditions, i.e. in a iota! reaction volume of 50 jlL containing 200 jiM of each dNTP wherein 

N was A, T, G and C, 1 u,M of each primer, -50 ng template DNA and 1 unit of A mpliTaq 
DNA polymerase (Perkin Elmer, Norwalk, CT). Amplifications were performed for a total of 
35 cycles (1 cycle = 94°C for 1 min, 60°C for 1 min, and 72°C for 2 min). After 
amplification. PCR products were digested with Xbal, gel purified, and ligated into the Xbal 

2 0 site of pCDNA3. 1 and pCENeo. After transformation of the ligated DNA into appropriate 

bacteria] hosts (for example DH5a), plasmid DNA was prepared from individual clones and 
subjected to restriction enzyme and sequence analysis to identify clones that contained MMP19 
DNA with the correct sequence and in the proper orientation. 

B. Transfection of MMP19 DNA Expression Vectors into HEK293 Cells: Expression 

2 5 of MMP19 constructs in HEK293 cells is achieved by transfection using a modified calcium- 

phosphate procedure (Chen and Okayama, Mol. Cell. Biol. 7: 2745-2752, 1987). HEK-293 
cells (ATCC CRL1573) are grown in minimal essential medium (MEM) containing 10% fetal 
bovine serum (FBS) and antibiotic/ami mycotic solution at 37° C in 5% CO2, 95% O2. Cells 
are transfected with purified plasmid MMP19 DNA (20 u.g/2.5 X 10 6 cells/10 cm plate). For 

3 0 stable transfections, cells are grown in the presence of G418 and MEM/FBS. For both 

transient and stable transfections, serum-free media is harvested 3 days after medium 
replacement. 

C. Detec tion of MMP 19 protein ^Western blot): Media from transfected cells is mixed 
(vol: vol, 1 : 1) with 2X denaturing buffer (10 mM Tris-HCl, pH6.8, 4% SDS, 20% glycerol, 

3 5 1 % B-mercaptoethanol, 0.02% bromphenol blue) and denatured by heating 5 minutes, 95°C. 
Altquots (20u.I) of each sample are electrophorescd for -1 hour on a 10-20% Tris-tricine 
minigc! (10 X 10 cm) at -50 mA constant current. Gels are transferred to PVDF membranes 



WO 98/40475 



PCT/US98/04694 



51 

by electroblotting at 200 rnA for 1.5 hours. Non-specific binding sites on PVDF membranes 
are blocked for 1 hour at room temperature with 5% nonfat dry milk in Tris-buffered saline 
containing Tween-20 (TBS-Tween) (20 mM Tris, pH 7.6, 140 mM NaCl, 0.1% Tween-20). 
Blots are then incubated for 1 hour at room temperature in TBS-Twecn/5% milk containing an 
5 appropriate amount of primary antibody (either anti-FLAG or anti-c-Myc). Blots are washed 
in 3 changes of TBS-Tween for a total of 45 minutes at room temperature before incubating 
with the detection antibody (1: 1000 dilution) for 1 hour at room temperature in TBS- 
Tween/5% milk. Blots are again washed in 3 changes of TBS-Tween for 45 minutes at room 
temperature. Protein bands are visualized by autoradiography using ECL Western blotting 
1 0 detection reagents (Amersham, Arlington Heights, 1L). 

Example 4: Assays of Enzymatic Activity of MMP19 

Purification of the MMP1 9 protein containing the FLAG sequence is performed by 

1 5 immunoaffinity chromatography using an affinity matrix comprising anti-FLAG M2 

monoclonal antibody covalently attached to agarose by hydrazide linkage (Eastman Kodak 
Co., New Haven, CT). Prior to affinity purification, medium from transfected HEK293 cells 
is exchanged into 50 mM Tris-HCl pH 7.5, 1 50 mM NaCi buffer using a Sephadex G-25 
(Pharmacia Biotech Inc., Uppsala, Sweden) column. Protein in this buffer is applied to the 

2 0 anti-FLAG M2 antibody affinity column, non-binding protein is eluted by washing the column 

with 50 mM Tris-HCl pH 7.5, 150 mM NaCl buffer, and bound protein is eluted using an 
excess of FLAG peptide in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl. The excess FLAG 
peptide is removed by size exclusion chromatography and column fractions containing 
MMP19 protein are dialyzed in 50 mM Tris-HCl, pH 7.5, 150 mM NaCl, 10 MM CaCl. 

2 5 Assays to determine enzyme activity are performed using a fluorogenic peptide 

substrate Gly-Glu(EDANS)-Gly-Pro-Leu-Gly-Leu-Tyr-Ala-Lys(DABCYL)-Gly (SEQ ID 
NO: 1 3). SEQ ID NO: 1 3 was synthesized and purified acording the the procedure of E.D. 
Matayoshi et al„ Science, 247: 954-958, 1990). Hydrolysis of the Gly-Leu bond' in SEQ ID 
NO: 13 results in a 40-fold increase in fluorescence. 

3 0 Purified MMP19 protein from transfected HEK293 cell medium is serially diluted and 

incubated in microtiter plate wells with 100 U.M fluorogenic peptide substrate, 50 mM Tris- 
HCl, pH 7.5, 150 mM NaCl in a final volume of 150 p.L. Assay mixtures are incubated at 
room temperature and the progress of the reaction monitored for up to 1 hour in an Titcrtek 
Fluoroskan II instrument (ICN Biomedicals, Huntsville, AL) with an excitation filter set at 335 
3 5 nm and emission filter at 485 nm. Background fluorescence is determined using identical 

reactions in the absence of MMP19. Data is collected online with a Macintosh computer using 
DELTA SOFT II. version 4.0 (BioMetallics, Inc., Princeton, NJ). Nonlinear curve fitting is 
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performed using KaleidaGraph (Synergy Software, Reading, PA). Compounds which 
potentially inhibit MMP19 activity are screened by including serial dilutions of each compound 
with the above reaction and comparing the MMP 19 enzymatic activity in the presence and 
absence of compound. 

Example 5: Proteolysis of pro-MMPs bv MMP19 

Interaction of MMP19 with other MMPs is determined by incubation of purified active 
MMP19 with the zymogen (pro-peptide) forms of Gelatinase-A (72 kD) [MMP-2] and 
Gelatinase-B (92 kD) [MMP-9]. Gelaiinase-A and Gclatinase-B are isolated from HT-1080 
cell cultures stimulated with TNFa as described in V. Antibiotics 45: 1733-1737, 1992 and 
Biochem. J. 285: 603-61 1, 1992. After 1 hour incubation at 37°C in buffer containing 50 mM 
Tris-HCl, pH 7.5, 150 mM NaCl and approximately 250 ng of each enzyme, aliquots are 
removed and electrophoresed on 10-20% Tris-Tricine gels. Western blots of gels are 
performed using antibodies to Gelatinse A and Gelatinase B (Oncogene Science, Cambridge, 
MA) as described above and proteolysis is indicated by the presence of bands of lower 
molecular weight relative to the pro-enzyme forms of Gelatinase A and Gelatinase B. 

Example 6: Production of Polyclonal Ant ibodies to MMP19 

Synthetic peptides (SEQUENCE ID NOS: 14-17, see Table 1 below) are prepared 
based upon the predicted amino acid sequence of the MMP 19 polypeptide. 



Table 1 




Sequence 


SEQ ID NO: 


RHRTKMRRKKRFAKQGN 


14 


FQGDHN DGLGN AFDG 


15 


RSLQDWGGIPEEVSGALPRPDGSH 


16 


ATELPWMGCWHANSGSALF 


17 



Peptides are synthesized on an ABI Peptide Synthesizer (Applied Biosy stems, Foster 
City, CA), using standard reagents and conditions known in the art for solid phase peptide 
synthesis (see for example, Stewart, J.M., and Young, DJ.. Solid Phase Peptide Synthesis, 
W.H. Freeman Co., San Francisco, 1963). Cleavage of the peptide from the resin and final 
deprotection of the peptide are achieved by adding the resin to 20 ml mfluoroacetic acid (TFA), 
0.3 ml water, 0.2 ml ethanedithiol, 0.2 ml thioanisole and 100 mg phenol, and stirring at room 
temperature for 1 .5 hours. The resin then is filtered by suction and the peptide obtained by 
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precipitation of the TFA solution with ether, followed by filtration. Each peptide is purified 
via reverse-phase preparative HPLC using a water/acetonitrile/O. i% TFA gradient and 
lyophilized. The product is confirmed by mass spectrometry. 

-To generate antigens for immunization, the purified peptides arc conjugated to Keyhole 
5 Limpet Hemocyanin (KLH) and bovine serum albumin (BSA) using an Imject Activated 

Immunogen Conjugation Kit (Pierce, Rockford, IL) in accordance with the manufacturer's 
instructions. 

Polyclonal antisera are generated using the protocol of the Berkeley Antibody 
Company (Berkeley, CA). Before receiving the first immunization, a sample of preimmune 
1 0 blood (5 ml) is drawn from each of at least 2 rabbits. Afterward, each rabbit is injected 
subcutaneously with an aliquot of KLH-conjugated peptide (200-500 mg) in Complete 
Freunds Adjuvant. After 21 days, the immune response is boosted with a second 
subcutaneous injection of KLH-conjugated peptide (100-250 mg) in Incomplete Freund's 
Adjuvant. Blood (50 ml) is collected on day 3 1 and serum tested for reactivity to BSA- 

1 5 coupled peptide using an enzyme linked immunoadsorbam assay (ELIS A). Subsequent boosts 

with KLH-conjugated peptide are given on days 42, 63 and 84 (post injection #1) and 
production bleeds (50 ml) drawn on days 52, 73 and 94 for testing by ELISA to determine 
antibody titer. Serum is then stored at -20°C until further use. 

2 0 Example 7: Isolation of the 5 1 end of MMP19 cDNA 

CapFinder cDNA synthesis kit (Clontech, Palo Alto, CA) was used to amplify DNA at 
the 5' end of MMPI9 cDNA. Briefly, reverse transcription reactions using human testes 
mRNA (Clontech, Palo Alto, CA) were performed us specified by the manufacturer. The 

2 5 cDNA was amplified by PCR using an oligonucleotide primer complementary to MMPI9 

cDNA (nucleotides 704-678 of SEQ ID NO: 1) and the 5' CapSwitch oligonucleotide. PCR 
amplifications were performed in a total reaction volume of 100 uX containing 200 u.M of each 
dNTP wherein N was A, T, G and C, I U.M of each primer, 20 |iL of cDNA from the reverse 
transcription reaction, and 1 unit of AmpliTaq DNA polymerase (Perkin Elmer, Norwalk, 

3 0 CT). Amplifications were performed for a total of 35 cycles ( 1 cycle = 94°C for 1 min, 60°C 

for 1 min, and 72°C for 2 min). Two PCR reaction products (1.3 kb and 1.0 kb) identified by 
agarose gel electrophoresis and ethidium bromide staining were large enough to contain 
' sequences upstream of the 5' end of the original sequence. These fragments were purified and 
suncloned into pCR-Script (Stratagene, La Jolla, CA) in accordance with the protocol provided 
3 5 by the supplier. 

Clones are analyzed for sequences encoding the putative 5 ! end of MMP19 mRNA 
using an oligonucleotide complementary to MMP19 cDNA (nucleotides 704-678 of SEQ ID 
NO: 1) in standard nucleotide sequencing reactions. Clones are verified as authentic MMP19 
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cDNA by PCR amplification of predicted size bands using testes mRNA as template and 
primers within both the new sequence and complementary to the original MMP19 cDNA 
sequence (nucleotides 655-626 of SEQ ED NO:l). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: Falduto, Michael 

Magnuson, Scott R. 
Morgan, Douglas w. Morgan 

<ii) TITLE OF THE INVENTION : HUMAN MATRIX METALLO PROTEASE , 
PROTEINS ENCODED THEREFROM AND USES THEREOF 



{iii> NUMBER OF SEQUENCES: 33 

(iv) CORRESPONDENCE ADDRESS: 

{A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 
ID) STATE: IL 

(E) COUNTRY: USA 

(F) ZIP : 60064-3500 

(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE : Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: ll-MAR-1997^ 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Casuto, Dianne 

(B) REGISTRATION NUMBER: P-40,943 

(C) REFERENCE/DOCKET NUMBER: 6073. US. 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (847) 938-3137 

(B) TELEFAX: {B47) 538-2623 

(C) TELEX: 



(2> INFORMATION FOR SEQ ID NO : 1 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



SO 
120 

ieo 

240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 



AAAGCTCCCA CCTCCACTCG ATTCAGCGAT GCCATCAGAG CGTTTCAGTG GGTGTCCCAG 
CTACCTGTCA GCGGCGTGTT GGACCGCGCC AACCTGCGCC AGATGACTCG TCCCCGCTGC 
GGGGTTACAG ATACCAACAG TTATGCGGCC TGGGCTGAGA GGATCAGTGA CTTGTTTGCT 
AGACACCGGA CCAAAATGAG GCGTAAGAAA CGCTTTGCAA AGCAAGGTAA CAAATGGTAC 
AAGCAGCACC TCTCCTACCG CCTGGTGAAC TGGCCTGAGC ATCTGCCGGA GCCGGCAGTT 
CGGGGCGCCG TGCGCGCCGC CTTCCAGTTG TGGAGCAACG TCTCAGCGCT GGAGTTCTGG 
GAGGCCCCAG CCACAGGCCC CGCTGACATC CGGCTCACCT TCTTCCAAGG GGACCACAAC 
GATGGGCTGG GCAATGCCTT TGATGGCCCA GGGGGCGCCC TGGCGCACGC CTTCCTGCCC 
CGCCGCGGCG AAGCGCACTT CGACCAAGAT GAGCGCTGGT CCCTGAGCCG CCGCCGCGCG 
CGCAACCTGT TCGTGGTGCT GGCGCACGAG ATCGGTCACA CGCTTGGCCT CACCCACTCG 
CCCGCGCCGC GCGCGCTCAT GGCGCCCTAC TACAAGAGGC TGGGCCGCGA CGCGCTGCTC 
AGCTGGGACG ACGTGCTGGC CGTGCAGAGC CTGTATGGGA AGCCCCTAGG GGGCTCAGTG 
GCCGTCCAGC TCCCAGGAAA GCTGTTCACT GACTTTGAGA CCTGGGACTC CTACAGCCCC 
CAAGGAAGGC GCCCTGAAAC GCAGGGCCCT AAATACTGCC ACTCTTCCTT CGATGCCATC 
ACTGTAGACA GGCAACAGCA ACTGTACATT TTTAAAGGGA GCCATTTCTG GGAGGTGGCA 
GCTGATGGCA ACGTCTCAGA GCCCCGTCCA CTGCAGGAAA GATGGGTCGG GCTGCCCCCC 
AACATTGAGG CTGCGGCAGT GTCATTGAAT GATGGAGATT TCTACTTCTT CAAAGGGGGT 
CGATGCTGGA GGTTCCGGGG CCCCAAGCCA- GTGTGGGGTC TCCCACAGCT GTGCCGGGCA 
GGGGGCCTGC CCCGCCATCC TGACGCCGCC CTCTTCTTCC CTCCTCTGCG CCGCCTCATC 114 0 
CTCTTCAAGG GTGCCCGCTA CTACGTGCTG GCCCGAGGGG GACTGCAAGT GGAGCCCTAC 
TACCCCCGAA GTCTCCAGGA CTGGGGAGGC ATCCCTCAGG AGGTCAGCGG CGCCCTGCCG 
AGGCCCGATG GCTCCATCAT CTTCTTCCGA GATGACCGCT ACTGGCGCCT CGACCAGGCC 
AAACTGCAGG CAACCACCTC GGGCCGCTGG GCCACCGAGC TGCCCTGGAT GGGCTGCTGG 
CATGCCAACT CGGGGAGCGC CCTGTTCTGA AGGCACCTCC TCACCTCAGA AACTGGTGGT 144 0 
GCTCTCAGGG CAAAATCATG TTCCCCACCC CCGGGGCAGA ACCCCTCTTA GAAGCCTCTG i 
AGTCCCTCTG CAGAAGACCG GGCAGCAAAG CCTCCATCTG GAAGTCTGTC TGCCTTTGTT 
CCTTGAAGAA TGCAGCATTG TCTTTGTCTG TCCCCACCAC ATGGAGGTGG GGGTGGGATC 
AATCTTAGGA AAAGCAAAAA AGGGTCCCAG ATCCCTTGGC CCTTTCCTCC GAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1717 base, pairs 
{ B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



1200 
1260 
1320 
13B0 



1500 
1560 
1620 
1680 
1717 



(xi) SEQUENCE DESCRIPTION : 

TTTCGAGGGT GGAGGTGAGC TAAGTCGCTA 
GATGGACAGT CGCCGCACAA CCTGGCGCGG 
CCCCAATGTC TATGGTTGTC AATACGCCGG 



SEQ ID NO: 2: 

CGGTAGTCTC GCAAAGTCAC CCACAGGGTC 
TTGGACGCGG TCTACTGAGC AGGGGCGACG 
ACCCGACTCT CCTAGTCACT GAACAAACGA 
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TCTGTGGCCT GGTTTTACTC CGCATTCTTT GCGAAACGTT TCGTTCCATT GTTTACCATG 240 
TTCGTCGTGG AGAGGATGGC GGACCACTTG ACCGGACTCG TAGACGGCCT CGGCCGTCAA 3 00 
GCCCCGCGGC ACGCGCGGCG GAAGGTCAAC ACCTCGTTGC AGAGTCGCGA CCTCAAGACC 360 
CTCCGGGGTC GGTGTCCGGG GCGACTGTAG GCCGAGTGGA AGAAGGTTCC CCTGGTGTTG 420 
CTACCCGACC CGTTACGGAA ACTACCGGGT CCCCCGCGGG ACCGCGTGCG GAAGGACGGG 4 BO 
GCGGCGCCGC TTCGCGTGAA GCTGGTTCTA CTCGCGACCA GGGACTCGGC GGCGGCGCCC 54 0 
GCGTTGGACA AGCACCACGA CCGCGTGCTC TAGCCAGTGT GCGAACCGGA GTGGGTGAGC ^ 
GGGCGCGGCG CGCGCGAGTA CCGCGGGATG ATGTTCTCCG ACCCGGCGCT GCGCGACGAG 
TCGACCCTGC TGCACGACCG GCACGTCTCG GACATACCCT TCGGGGATCC CCCGAGTCAC 
CGGCAGGTCG AGGGTCCTTT CGACAAGTGA CTGAAACTCT GGACCCTGAG GATGTCGGGG 
GTTCCTTCCG CGGGACTTTG CGTCCCGGGA TTTATGACGG TGAGAAGGAA GCTACGGTAG 
TGACATCTGT CCGTTGTCGT TGACATGTAA AAATTTCCCT CGGTAAAGAC CCTCCACCGT 
CGACTACCGT TGCAGAGTCT CGGGGCAGGT GACGTCCTTT CTACCCAGCC CGACGGGGGG 
SSSJcTCC GACGCCGTCA CAGTAACTTA CTACCTCTAA AGATGAAGAA GTTTCCCCCA 
SI^ACCT CCAAGGCCCC GGGGTTCGGT CACACCCCAG AGGGTGTCGA CACGGCCCGT 
CC^CCGGACG GGG^GGTAGG ACTGCGGCGG GAGAAGAAGG GAGGAGACGC GGCGGAGTAG 

gagaIotcc cacgggcgat GATGCACGAC CGGGCTCCCC ctgacgttca cctcgggatg 

ATGGGGGCTT SgACGTCCT GACCCCTCCG TAGGGACTCC TCCAGTCGCC GCGCGACGGC 

tccggg^ cgaggtagta gaagaaggct ctactggcga tgaccgcgga cctggtcccg 

TTTGACGTCC GTTGGTGGAG CCCGGCGACC CGGTGGCTCG ACGGGACCTA CCCGACGACC 
GTACGGTTGA GCCCCTCGCG GGACAAGACT TCCGTGGAGG AGTGGAGTCT TTGACCACCA 
CGAGAGTCCC GTTTTAGTAC AAGGGGTGGG GGCCCCGTCT TGGGGAGAAT . CTTCGGAGAC 
TCAGGGAGAC GTCTTCTGGC CCGTCGTTTC GGAGGTAGAC CTTCAGACAG ACGGAAACAA 
GGAACTTCTT ACGTCGTAAC AGAAACAGAC AGGGGTGGTG TACCTCCACC CCCACCCTAG 

?SSaatcct tttcgttttt tcccagggtc tagggaaccg GGAAAGGAGG cttttttttt 

TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTT 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTATCTCTAG ACACCATGTT TGCAAAGCAA GGTAACAAAT GGTACAAGC 

(2) INFORMATION FOR SEQ ID NO : 4 : 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



600 
660 
720 
7B0 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GTATCTCTAG ACACCATCAA GGTTCTGTGG -TGCGTTGC TGGTCACATT CCTGGCAGGA 
TGCCAGGCCT TTGCAAAGCA AGGTAACAAA VOG 



60 
93 
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{2) INFORMATION FOR SEQ ID NO: 5: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 73 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTACTTCTAG ACTACTTGTC ATCGTCGTCC TTGTAGTCAC CACCGAACAG GGCGCTCCCC 6 0 

GAGTTGGCAT GCC 73 

£2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 base pairs 

(B) TYPE: nucleic acid 
(CJ STRAND EDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTACTTCTAG AGATCTTCTT CACTGATCAG CTTCTGTTCA CCACCGAACA GGGCGCTCCC €0 
CGAGTTGGCA TGCC 74 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 620 amino acids 

(B) TYPE: amino acid 

(C) ST RAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

{xi} SEQUENCE DESCRIPTION: SEQ ID N0:7: 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 . 10 15 

Xaa Xaa Xaa Leu Xaa Xaa. Leu Leu Leu Leu Xaa Xaa Xaa Xaa Xaa Xaa 

20 25 30 

Xaa Leu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Leu Xaa Xaa Tyr Tyr Xaa Leu Xaa 

SO 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
65 70 7S 80 

Xaa Xaa Xaa Leu Xaa Xaa Met Gin Xaa Phe Phe Gly Leu Xaa Val Thr 

85 90 S3 

Gly Lys Leu Asp Xaa Xaa Thr Leu Glu Xaa Met Xaa Lys Pro Arg Cys 
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100 

Gly Val Pro Asp Xaa 
115 

Phe Xaa Leu Xaa Pro 
130 

Tyr Arg lie Xaa Asn 
145 

Xaa Ala lie Xaa Lys 
165 

Xaa Phe Xaa Xaa Val 
ISO 

Xaa Ala Asp lie Met 
195 

Xaa Pro Phe Asp Gly 
210 

Gly Pro Gly He Gly 
225 

Thr Xaa Xaa Xaa Xaa 
245 

Xaa Xaa Xaa Xaa Xaa 
260 

Xaa Xaa Xaa Xaa Xaa 
275 

Xaa Xaa Xaa Xaa Xaa 
290 

Xaa Xaa Xaa Xaa Xaa 
305 

Xaa Xaa Xaa Xaa Xaa 
325 

Xaa Xaa Xaa Xaa Xaa 
340 

Xaa Xaa Xaa Xaa Xaa 
355 

Xaa Xaa Xaa Xaa Xaa 
370 

Xaa Xaa Xaa Xaa Xaa 
385 

Xaa Xaa Xaa Xaa Xaa 
405 

Xaa Xaa Xaa Xaa Xaa 
420 

Xaa Gly His Ser Leu 
435 

MeL Tyr Pro Xaa Tyr 
450 

Xaa Xaa Asp Asp He 
465 

Xaa Xaa Xaa Xaa Xaa 
465 

Xaa Xaa Xaa Xaa Xaa 

^ 500 
Xaa Xaa Xaa Xaa Xaa 
515 



10S 

Xaa Xaa Xaa Xaa Xaa 
120 

Xaa Xaa Pro Lys Trp 
135 

Tyr Thr Pro Asp Leu 
ISO 

Ala Phe Xaa Val Trp 
170 

Xaa Xaa Xaa Xaa Xaa 
185 

He Xaa Phe Ala Xaa 
200 

Pro Gly Gly Xaa Leu 
215 

Gly Asp Ala His Phe 
230 

Xaa Xaa Xaa Xaa Xaa 
250 

Xaa Xaa Xaa Xaa Xaa 
265 

Xaa Xaa Xaa Xaa Xaa 
280 

Xaa Xaa Xaa Xaa Xaa 
295 

Xaa Xaa Xaa Xaa Xaa 
310 

Xaa Xaa Xaa Xaa Xaa 
330 

Xaa Xaa Xaa Xaa Xaa 
345 

Xaa Xaa Xaa Xaa Xaa 
3S0 

Xaa Xaa Xaa Xaa Xaa 
375 

Xaa Xaa Xaa Xaa Xaa 
390 

Xaa Xaa Xaa Xaa Xaa 
410 

Gly Xaa Asn Leu Phe 
425 

Gly. Leu Xaa His Ser 
440 

Xaa Xaa Phe Xaa Asp 
455 

Xaa Gly He Gin Xaa 
470 

Xaa Xaa Xaa Xaa Xaa 
490 

Xaa Xaa Xaa Xaa Xaa 
505 

Xaa Xaa Xaa Xaa Xaa 
520 



110 

Xaa Xaa Xaa Val Xaa Xaa 

125 

Xaa Xaa Xaa Xaa Xaa Thr 
140 

Xaa Xaa Xaa Xaa Val Asp 
155 160 
Ser Xaa Val Thr Pro Leu 
175 

Xaa Xaa Xaa Xaa Glu Gly 
190 

Xaa Glu His Gly Asp Xaa 
205 

Ala His Ala Phe Xaa Pro 
220 

Asp Asp Asp Glu Xaa Trp 
235 240 
Xaa Xaa Xaa Xaa Xaa Xaa 
255 

Xaa Xaa Xaa Xaa Xaa Xaa 
270 

Xaa Xaa Xaa Xaa Xaa Xaa 
265 

Xaa Xaa Xaa Xaa Xaa Xaa 
300 

Xaa Xaa Xaa Xaa Xaa Xaa 
315 320 
Xaa Xaa Xaa Xaa Xaa Xaa 
335 

Xaa Xaa Xaa Xaa Xaa Xaa 
350 ■ 

Xaa Xaa Xaa Xaa Xaa Xaa 
365 

Xaa Xaa Xaa Xaa Xaa Xaa 
380 

Xaa Xaa Xaa Xaa Xaa Xaa 
395 400 
Xaa Xaa Xaa Xaa Xaa Xaa 
415 

Leu Val Ala Ala His Glu 
430 

Xaa Asp Pro Xaa Ala Leu 
445 

Xaa Xaa Xaa Phe Xaa Leu 
460 

Leu Tyr Gly Xaa Xaa Xaa 
475 4B0 
Xaa Xaa Xaa Xaa Xaa Xaa 
495 

Xaa Xaa Xaa Xaa Xaa Xaa 
510 

>..-; i Xaa Xaa Xaa Xaa Xaa 
525 
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Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Pro Xaa Xaa Cys Asp Xaa Xaa Xaa 

530 535 540 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Asp Ala lie Xaa Xaa Xaa 
54S 550 555 560 

Arg Gly Glu Xaa Phe Phe Phe Lys Asp Arg Phe Phe Trp Arg Xaa Xaa 

565 S70 575 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu lie Xaa Xaa Phe 

580 5B5 590 

Trp Pro Xaa Leu Pro Xaa Xaa lie Asp Ala Ala Tyr Glu Xaa Xaa Xaa 

595 600 60S 

Xaa Xaa Xaa Val Phe Phe Phe Lys Gly Xaa Xaa Tyr Trp Xaa Tyr Xaa 

610 615 620 

Gly Xaa Xaa Xaa Xaa Xaa Gly Tyr Pro Xaa Xaa lie Xaa Xaa Xaa Leu 
62S 630 635 640 

Gly Phe Pro Xaa Xaa Val Xaa Xaa lie Asp Ala Ala Xaa Xaa Xaa Xaa 

645 650 655 

Xaa Xaa Xaa Xaa Lys Thr Tyr Phe Phe Xaa Xaa Xaa Xaa Tyr Trp Arg 

660 665 670 

Tyr Asp Glu Xaa Xaa Xaa Xaa Met Asp Pro Gly Tyr Pro Lys Xaa lie 

675 ■' 680 685 

Xaa Xaa Xaa Phe Xaa Gly lie Xaa Xaa Xaa Val Asp Ala Val Phe Xaa 

690 695 700 

Xaa Xaa Xaa Xaa Gly Phe Xaa Tyr Phe Phe Xaa Gly Xaa Xaa Xaa Tyr 
705 710 715 720 

Xaa Phe Xaa Xaa Xaa Xaa Xaa Xaa Val Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

725 730 735 

Xaa Xaa Xaa Xaa Trp Leu Xaa Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

770 775 -780 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
785 790 795 800 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 810 815 

Xaa Xaa Xaa Xaa 
820 

(2) INFORMATION FOR SEQ ID N0:B: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(DJ TOPOLOGY: linear 

(ii> MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:B: 

Pro Arg Cys Gly Val Thr Asp 
1 5 



Xaa 



Xaa 



740 745 
Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
755 760 



7S0 

Xaa Xaa Xaa Xaa Xaa Xaa 
765 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Arg Lys Lys Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 10: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
tD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10: 



Lys 
l 


Ala 


Pro Thr 


Ser 
S 


Thr 


Arg 


Phe 


Ser 


Asp 
10 


Ala 


He 


Arg 


Ala 


Phe 
15 


Gin 


Trp 


Val 


Ser Gin 


Leu 


Pro 


val 


Ser 


Gly Val 


Leu 


Asp 


Arg 


Ala 


Asn 


Leu 




20 










25 










30 






Arg 


Gin 


Met Thr 


Arg 


Pro 


Arg 


Cys 


Gly 


val 


Thr 


Asp 


Thr 


Asn 


Ser 


Tyr 




35 








40 










45 








Ala 


Ala 
50 


Trp Ala 


Glu 


Arg 


He 
55 


Ser 


Asp 


Leu 


Phe 


Ala 
60 


Arg 


His 


Arg 


Thr 


Lys 
65 


Met 


Arg Arg 


Lys 


Lys 
70 


Arg 


Phe 


Ala 


Lys 


Gin 
75 


Gly 


Asn 


Lys 


Trp 


Tyr 
80 


Lys 


Gin 


His Leu 


Ser 
85 


Tyr 


Arg 


Leu 


val 


Asn 
90 


Trp 


Pro 


Glu 


His 


Leu 
95 


Pro 


Glu 


Pro 


Ala val Arg Gly Ala 


Val 


Arg 


Ala 


Ala 


Phe Gin Leu Trp Ser 






100 










105 










110 




Ala 


Asn 


val 


Ser Ala 


Leu 


Glu 


Phe 


Trp 


Glu 


Ala 


Pro 


Ala 


Thr Gly 


Pro 






115 








120 










125 








Asp 


lie 


Arg Leu 


Thr 


Phe 


Phe 


Gin Gly Asp 


His 


Asn 


Asp 


Gly 


Leu Gly 


130 








135 










140 










Asn 


Ala 


Phe Asp Gly 


Pro 


Gly 


Gly Ala 


Leu 


Ala 


His 


Ala 


Phe 


Leu 


Pro 


145 








150 










155 










160 


Arg 


Arg 


Gly Glu 


Ala 


His 


Phe 


Asp 


Gin Asp 


Glu 


Arg 


Trp 


Ser 


Leu 


Ser 




165 










170 










175 


Gly 


Arg 


Arg 


Arg Gly Arg 


Asn 


Leu 


Phe 


Val 


Val 


Leu 


Ala 


His 


Glu 


He 


1B0 










185 










190 






His 


Thr 


Leu Gly 
1 o e; 


Leu 


Thr 


His 


Ser 
200 


Pro 


Ala 


Pro 


Arg 


Ala 
205 


Leu 


Met 


Ala 
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Pro 


Tyr 


Tyr 


Lys 


Arg 


Leu 


Gly 


Arg 


Asp 


Ala Leu 


Leu 


Ser 


Trp 


Asp 


Asp 




210 










215 








220 










Val 


Leu 


Ala 


Val 


Gin 


Ser 


Leu 


Tyr 


Gly 


Lys Pro 


Leu 


Gly Gly Ser 


Val 


22S 










230 








235 










240 


Ala 


Val 


Gin 


Leu 


Pro 


Gly 


Lys 


Leu 


Phe 


Thr Asp 


Phe 


Glu Thr Trp 


Asp 










245 










250 








255 




Ser 


Tyr 


Ser 


Pro 


Gin 


Gly 


Arg 


Arg 


Pro 


Glu Thr 


Gin Gly 


Pro 


Lys 


Tyr 








260 










265 








270 






Cys 


Kis 


Ser 


Ser 


Phe 


Asp 


Ala 


He 


Thr 


Val Asp 


Arg 


Gin 


Gin 


Gin 


Leu 






275 










2B0 








265 








Tyr 


He 


Phe 


Lys 


Gly 


Ser 


His 


Phe 


Trp 


Glu Val 


Ala Ala Asp Gly 


Asn 




290 










295 








300 










Val 


Ser 


Glu 


Pro 


Arg 


Pro 


Leu 


Gin 


GlU 


Arg Trp 


Val 


Gly Leu 


Pro 


Pro 


305 










310 








315 










320 


ASTL 


He 


Glu 


Ala 


Ala 


Ala 


Val 


Ser 


Leu 


Asn Asp 


Gly Asp 


Phe 


Tyr 


Phe 










325 










330 








335 




Phe 


Lys 


Gly 


Gly 


Arg 


Cys 


Trp 


Arg 


Phe 


Arg Gly 


Pro 


Lys 


Pro 


Val 


Trp 








340 










345 








350 






Gly 


Leu 


Pro 


Gin 


Leu 


Cys 


Arg 


Ala 


Gly 


Gly Leu 


Pro 


Arg 


His 


Pro 


Asp 






355 










360 








365 








Ala 


Ala 


Leu 


Phe 


Phe 


Pro 


Pro 


Leu 


Arg 


Arg Leu 


He 


Leu 


Phe 


Lys 


Gly 




370 










37S 








380 










Ala 


Arg 


Tyr 


Tyr 


Val 


Leu 


Ala 


Arg 


Gly 


Gly Leu 


.Gin 


Val 


Glu 


Pro 


Tyr 


3BS 










390 








395 










400 


Tyr 


Pro 


Arg 


Ser 


Leu 


Gin 


Asp 


Trp 


Gly 


Gly He 


Pro 


Glu 


Glu 


Val 


Ser 










405 










410 








415 




Gly 


Ala 


Leu 


Pro 


Arg 


Pro 


Asp 


Gly 


Ser 


He He 


Phe 


Phe 


Arg 


ASp 


Asp 








420 










425 








430 






Arg 


Tyr 


Trp 


Arg 


Leu 


Asp 


Gin 


Ala 


Lys 


Leu Gin 


Ala 


Thr 


Thr 


Ser 


Gly 






435 










440 








445 








Arg 


Trp 


Ala 


Thr 


Glu 


Leu 


Pro 


Trp 


Met 


Gly Cys 


Trp 


His 


Ala 


Asn 


Ser 




450 










455 








460 










Gly 


Ser 


Ala 


Leu 


Phe 























465 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7" amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Pro Arg Cys Gly Val Pro Asp 
1 5 

(2) INFORMATION FCR SEQ ID NO : 12 : 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

His Glu Xaa Gly His Xaa Leu Gly Leu Xaa His Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Gly Glu Gly Pro Leu Gly Leu Tyr Ala Lys Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) -TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 
Arg His Arg Thr Lys Met Arg Arg Lys Lys Arg Phe Ala Lys Gin Gly 

1 5 10 

Asn 



(2) INFORMATION FOR SEQ ID NO: IS: 

<i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Phe Gin Gly Asp His Asn Asp Gly Leu Gly Asn Ala Phe Asp Gly 
X 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Arg Ser Leu Gin Asp Trp Gly Gly lie Pro Glu Glu Val Ser Gly Ala 

1 5 10 IS 

Leu Pro Arg Pro Asp Gly Ser He He 
20 25 

(2) INFORMATION FOR SEQ ID NO: 17: 

ti) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 amino acids 
(BJ TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: H : 

Ala Thr Glu Leu Pro Trp Met Gly Cys Trp His Ala Asn Ser Gly Ser 

1 5 10 15 

Ala Leu Phe 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Asn Cys Gin Gin Leu Trp Leu Gly Phe Leu Leu Pro Met Thr Val 
! 5 10 IS 
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Ser Gly 

Ser Gin 

Lys Pro 
50 

Glu -Leu 
65 

Arg Gin 

Lys Tyr 

He Leu 

Leu Arg 
130 
Gin Glu 
145 

Arg- Gin 

Leu Ala 

Asp Glu 

He Ala 
210 
Tyr Ser 
225 

Phe Lys 

Lys Lys 

Pro Thr 

Cya Ser 
290 
Tyr Ala 
305 

Gly Pro 

Leu Asp 

Lys Gly 

Phe Pro 
370 
Tyr Trp 
365 

Trp Gin 
Pro lie 
Met Ser 



Arg Val 

20 
Tyr Gly 
35 

Glu Asp 

Pro Val 

Pro Arg 

Leu Leu 
100 
Asn Leu 
115 

Gin Ala 

Val Gin 

Ser Ser 

His Ala 
180 

Phe Trp 
195 

Ala His 

Gin Ala 

Leu His 

Ser Pro 
260 
Val Pro 
275 

Ser Glu 
Phe Lys 
Leu Phe 
Ala Ala 

340 

Asp Lys 
355 

Lys Lys 

Pro Leu 

Trp Asp 

Lys Gly 
420 
Trp Gin 



Leu Gly Leu 

Tyr Leu Gin 

He Thr Glu 
55 

Ser Gly Gin 
70 

Cys Gly Leu 
85 

Leu Gly Arg 

Pro Ser Thr 

Phe Gin Asp 
135 

Ala Gly Ala 
150 

Tyr Cys Ser 
165 

Asp He Pro 

Thr Glu Gly 

Glu Val Gly 
215 

Leu Met Ala 

230 
Pro Asp Asp 
245 

Val He Arg 

Pro Val Pro 

Leu Asp Ala 
295 

Gly Asp Tyr 
310 

Arg Val Ser 
325 

Val Tyr Ser 

Val Trp Arg 

Leu Asn Arg 
375 

Asn Gin Lys 

390 
Glu Leu Ala 
405 

Leu Phe Thr 
Asp Gly Arg. 



Ala Glu 

25 
Lys Pro 

40 

Ala Leu 

Leu Asp 

Glu Asp 

Trp Arg 
105 
Leu Pro 
120 

Trp Ser 

Ala Asp 

Asn Thr 

Glu Leu 
185 
Thr Tyr 
200 

His Ala 

Pro Val 

Val Ala 

Asp Glu 
265 
Thr Glu 
280 

Met Met 

Val Trp 

Ala Leu 

Pro Arg 
345 
Tyr He 
360 

Ser Glu 

Val Phe 

Arg Thr 

Gly Val 
425 
Val Tyr 



€5 

Val Ala Pro Val 
Leu Glu 
Arg Ala 



Asp Ala 

75 
Pro Phe 
90 

Lys Lys 

Pro His 

Asn Val 

He Arg 
155 
Phe Asp 
170 

Gly Ser 

Arg Gly 

Leu Gly 

Tyr Glu 
23S 
Gly He 
250 

Glu Glu 

Pro Ser 

Leu Gly 

Thr Val 
315 
Trp Glu 
330 

Thr Gin 

Asn Phe 

Pro Asn 

Leu Phe 
39S 
Asp Phe 
410 

Pro Asn 
Phe Phe 



Gly Ser 

45 
Phe Gin 
€0 

Thr Arg 



Asn Gin 

His Leu 

Thr Ala 
125 
Ala Pro 
140 

Leu Ser 

Gly Pro 

Val His 

Val Asn 
205 
Leu Gly 
220 

Gly Tyr 

Gin Ala 

Glu Glu 

Pro Met 
285 
Pro Arg 
300 

Ser Asp 

Gly Leu 

Trp He 

Lys Met 
365 
Leu Asp 
360 

Lys Gly 
Ser Ser 
Gin Pro 
Lys Gly 



Asp Tyr Leu 
30 

Asn Asn Phe 

Glu Ala Ser 

Ala Arg Met 
80 

Lys Thr Leu 
9S 

Thr Phe Arg 
110 

Arg Ala Ala 

Leu Thr Phe 

Phe His Gly 
160 

Gly Arg Val 

175 
Phe Asp Glu 
190 

Leu Arg He 

His Ser Arg 

Arg Pro Hie 
240 

Leu Tyr Gly 

255 
Thr Glu Leu 
270 

Pro Asp Pro 

Gly Lys Thr 

Ser Gly Pro 
320 

Pro Gly Asn 

335 
His Phe Phe 
350 

Ser Pro Gly 

Ala Ala Leu 

Ser Gly Tyr 
400 

Tyr Pro Lys 

415 
Ser Ala Ala 
430 

Lys Val Tyr 
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xrp « g III Asn Gin Gin Leu Arg val ol» Lys Gly Tyr Pro Arg Asn 

XI. Ser His Asn Trp Met "is Cys Arg Pro Arg Thr He Asp Thr Thr 

Pro Ser Gly Gly Asn Thr Thr Pro Ser Gly Thr Gly lie Thr Leu Asp 
485 490 

Thr Thr Leu Ser Ala Thr Glu Thr Thr Phe Glu Tyr 



500 



SOS 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 660 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

net Clu Ala Leu Met Ala Arg Gly Ala Leu Thr Gly Pro Leu Arg Ala 

Leu Cys Leu Leu Gly Cys Leu Leu Ser His Ala Ala Ala Ala Pro Ser 

Pro lie He Lys Phe Pro Gly Asp Val Ala Pro Lys Thr Asp Lys Glu 

Leu Ala Hi Gin Tyr .Leu Asn Thr Phe Tyr Gly Cys Pro Lys Glu Ser 

Cys Asn Leu Phe Val Leu Lys Asp Thr Leu Lys Lys Met Gin Lys Phe 

Phe Gly Leu Pro Gin Thr Gly Asp Leu Asp Gin Asn Thr lie Glu Thr 

„et Arg Lys Pro Arg Cys Gly Asn Pro As P v.l Ala Asn Tyr Asn Phe 

100 105 
Phe Pro Ar 3 Lys Pro Lys Trp Asp Lys Asn Gin lie Thr Tyr Arg lie 

115 I 20 

lie Gly Tyr Thr Pro Asp Leu Asp Pro Glu Thr v.l Asp Asp Ala Phe 
Ala III Ala Phe Gin Val \H Ser Asp Val Thr Pro Leu Arg Phe Ser 
Arg lie His Asp Gly Glu Ala Asp He Met lie Asn Phe Gly Arg Trp 
Glu His Gly Asp Oil Tyr Pro Phe Asp Gly Lys Asp Gly Leu Leu Ala 
His Ala Phe HI Pro Gly Thr Gly val Gly Gly Asp Ser His Phe Asp 
Asp Asp oil Leu Trp Thr Leu Gly Glu Gly G!n val val Arg Val Lys 
Tyr III Asn Ala Asp Gly llu Tyr Cys Lys *h. Pro Ph, ",eu Phe Asn 
Gly Lys Glu Tyr Asn III cys Thr Asp Thr Gly Arg Ser Asp Gly Phe 
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67 



250 255 



Leu Trp Cys Ser Thr Thr Tyr Asn Phe gIu Lys Asp Gly Lys Tyr Gly 
Phe Cys Pro His Glu Ala Leu Phe Thr Met Gly Gly Asn Ala Glu Gly 
Gin Pre III Lys Phe Pro Phe a" Phe Gin Gly Thr Ser Tyr Asp Ser 
Cys Thr Thr Glu Gly Ar 9 Thr Asp Gly Tyr Arg Trp Cys Gly Thr Thr 



G °iu Asp Tyr Asp Arg Asp Lys Lys Tyr Gly Phe Cys Pro Glu Thr Ala 



H et ser Thr Val Gly Gly Asn Ser Glu Gly AU Pro Cys Val Phe Pro 
Phe Thr Phe Leu Gly Asn Lys Tyr Glu ser Cys Thr Ser AU Gly Arg 
ser Asp Gly Lys Me. Trp Cys AU Thr Thr Ala Asn Tyr Asp Asp Asp 
Arg Lys Trp =1, Phe Cys III Asp Gin Gly Tyr Ser Leu Phe Leu Val 
Ma Ala His Glu Phe g" His Ala Met Gly Leu Glu His Ser Gin Asp 
Pro Gly Ala Leu Met Ala Pro lie Tyr Tnr Tyr Thr Lys Asn Phe Arg 
Leu Ser Gin Z Asp lie Lys Gly Ue Gin G!u Leu Tyr Gly Ala Ser 
Pro Asp III Asp Leu Gly Thr "y Pro Thr Pro Thr Leu Gly Pro Val 
Th r Pro Glu He cys Lys «» Asp He Val Phe Asp Gly He Ala Gin 
\[l Ar 3 Gly Glu lie Z Phe Phe Lys Asp Arg Phe lie Trp Arg Thr 
val Thr Pro Arg Asp Lys Pro Met Gly Pro Leu Leu val Ala Thr Phe 



330 



50S 



Trp Pro Glu Z Pro Glu Lys II. Asp Ala Val Tyr Glu Ala Pro Gin 

520 5Jb 



Glu GXu III Ala val Phe Phe Ala Gly Asn Glu Tyr Trp Ue Tyr Ser 

Ala III Thr Leu Glu Arg "ly Tyr Pro Lys Pro Leu Thr Ser Leu Gly 

Leu Pro Pro Asp Val Gin Arg Val Asp Ala Ala Phe Asn Trp Ser Lys 
S€5 "0 575 



Asn Lys Lys Thr .Tyr lie Phe Ala Gly Asp Lys Phe Trp Arg Tyr Asn 



Glu val Lys Zl Lys Met Asp Pro Cl'y Phe Pro Lys Leu lie Ala Asp 
Ala Trp III Ala He Pro Asp III Leu Asp Ala val Val -Asp Leu Gin 
Gly Gly Gly His Ser Tyr Pne Ph. Lys Gly Ala Tyr Tyr Leu Lys Leu 

Glu Asn Gin ser Leu "I Ser val Lys Phe Gly Ser lie Lys Ser Asp 

645 

Trp Leu Gly Cys 
660 



650 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 707 amino acids 
{B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 2 0 : 

„ec Ser Leu Trp Gin Pro Leu Val Leu Val Leu Leu Val Leu Gly Cys 

Cys Phe Ala Ala Pro Arg Gin Arg Gin Ser Thr Leu Val Leu Phe Pro 

Cly Asp Leu Arg Thr Asn Leu Thr Asp Arg Gin Leu Ala Glu Glu Tyr 

Leu Tyr Arg Tyr Gly Tyr Thr Arg Val Ala Glu Met Arg Cly Glu Ser 

Lys Ser Leu Gly Pro Ala Leu Leu Leu Leu Gin Lys Gin Leu Ser Leu 

70 75 
Pro Glu Thr Gly Glu Leu Asp S.. Ala Thr Leu Lys Ala Met Arg Thr 

85 90 
Pro Arg Cys Gly Val Pro Asp Leu Gly Arg Phe Gin Thr Phe Glu Gly 

Asp Leu Lys Trp His His His Asn "e Thr Tyr Trp lie Gin Asn Tyr 

Ser Glu Asp Leu Pro Arg Ala III lie Asp Asp Ala Phe Ala Arg Ala 

Phe "la Leu- Trp Ser Ala III Thr Pro Leu Thr Pne Thr Arg val Tyr 

ser Arg Asp Ala Asp \ll Val He Gin Phe Gly Val Ala Glu His Gly 

Asp Gly Tyr Pro p" Asp Gly Lys Asp Gly Leu Leu Ala His Ala Phe 

180 185 
Pro Pro Gly Pro Gly He Gin Gly Asp Ala His Phe Asp Asp Asp Glu 
200 205 



Leu Trp III Leu Gly Lys Gly val Val val Pro Thr Arg Phe Gly Asn 



Ala Asp Gly Ala Ala Cys "is Phe Pro Phe lie Phe Glu Gly Arg Ser 

Tyr Ser Ala Cys Thr III Asp Gly Arg Ser Asp Gly Leu Pro Trp Cys 

ser Thr Thr Ala Asn Tyr Asp Thr Asp Asp Arg Phe Gly Phe Cys Pro 

ser Glu Arg Leu Tyr Thr Arg Asp "y Asn Ala As P Gly Lys Pro Cys 

Gin Phe Pro Phe lie Phe Gin My Gin Ser Tyr Ser Ala Cys Thr Thr 

,00 295 300 

Asp "y A.-, ser Asp Gly Tyr Arg Trp Cys Ala Thr Thr Ala Asn Tyr 

asp Arg A.p Lys Leu 111 Gly Phe Cys Pro Thr Arg Ala Asp Ser Thr 
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V.1-H.L Cly Gly 1" Ser Ala Gly clu Uu Cys Val Phe Pro Ph. Thr 
Phe Leu oly Lys Olu Tyr Ser Thr Cys Thr Ser clu Oly Arg Gly Asp 
Oly Arg Leu Trp Cys Ala Thr Thr Ser Asn Phe Asp Ser Asp Lys Lys 

375 jow 

Trp oly Phe Cys Pro Asp Ola Oly Tyr Ser Leu Phe Leu val Ala Ala 
£ Olu Phe Oly His Ala Leu Oly Leu Asp His Ser Ser Val Pro Clu 



Ala Leu H.t Tyr Pro Me, Tyr Arg Phe Thr Olu Oly Pro Pro Leu His 
Lys Asp Asp Z A» Oly He Arg His Leu Tyr Oly Pro Arg Pro Olu 



Pro Olu Pro Arg Pro Pro Thr Thr Thr Thr Pro Gin Pro Thr Ala Pro 

450 4S5 



Pro Thr val Cys Pro Thr cly Pro Pro Thr Val His Pro Ser Clu Arg 
Pro Thr Ala Oly Pro Thr Oly Pro Pro Ser Ala Oly Pro Thr Oly Pro 
Pro Thr Ala Oly Pro Ser Thr Ala Thr Thr val Pro Leu s.r Pro V.1 
Asp Asp Ala ": Asn val Asn lie Phe Asp Ala He Ala Clu He Oly 



520 525 
Asn Oln Leu Tyr Leu Phe Lys Asp Oly Lys Tyr Trp Arg Phe Ser Olu 

Gly A r 3 Oly Ser Arg Pro "in Oly Pro Phe Leu He Ala Asp Lys Trp 

550 333 

III Ma Leu Pro Arg Lys Leu Asp Ser Val Phe Olu Olu Pro Leu Ser 
Lys Lys Leu Phe p" Phe Ser Oly Arg Tin' Val Tr P Val Tyr Thr Oly 



Al a Ser Val Leu Oly Pro Arg Arg Leu Asp Lys Leu Oly Leu Cly Ala 
Asp val III Oln val Thr Oly "a Leu Arg Ser Oly Arg Cly Lys Me, 
Leu Leu Phe Ser Oly Arg Arg Leu Trp Arg Phe Asp Val Lys Ala Oln 
„" val Asp Pro Arg Ser Ala Ser Clu Val Asp Arg Met Phe Pro Oly 

val Pro Leu Asp Thr His Asp Val Phe III Tyr Arg Clu Lys Ala Tyr 

660 



645 

His Asp Val 

Phe Cys Oln Asp Arg Phe Tyr Trp a" Val Ser Ser Arg Ser Olu Leu 
Asn Oln III Asp Oln Val oly Tyr Val Thr Tyr Asp He Leu oln Cys 
690 «9S 700 

Pro Glu Asp 
70S 



(2) INFORMATION FOR SEQ ID NO: 21: 



<i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 4 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Lys Ser Leu Pro He Leu Leu Leu Leu Cys Val Ala val Cys Ser 

1 5 10 15 

Ala Tyr Pro Leu Asp Gly Ala Ala Arg Gly Glu Asp Thr Ser Met Aan 

20 25 30 

Leu Val Gin Lys Tyr Leu Glu Asn Tyr Tyr Asp Leu Lys Lys Asp Val 

35 40 45 

Lys Gin Phe Val Arg Arg Lys Asp Ser Gly Pro Val Val Lys Lys He 

50 55 60 

Arg Glu Met Gin Lys Phe Leu Gly Leu Glu Val Thr Gly Lys Leu Asp 
65 70 75 80 

Ser Asp Thr Leu Glu Val Met Arg Lys Pro Arg Cys Gly Val Pro Asp 

8S 90 95 

Val Gly His Phe Arg Thr Phe Pro Gly He Pro Lys' Trp Arg Lys Thr 

100 105 110 

His Leu Thr Tyr Arg He Val Asn Tyr Thr Pro Asp Leu Pro Lya Asp 

115 120 125 

Ala Val Asp Ser Ala Val Glu Lys Ala Leu Lys Val Trp Glu Glu Val 

130 135 140 

Thr Pro Leu Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp He Met 
145 150 155 160 

He Ser Phe Ala Val Arg Glu His Gly Asp Phe Tyr Pro Phe Asp Gly 

165 170 175 

Pro Gly Asn Val Leu Ala His Ala Tyr Ala Pro Gly Pro Gly He Asn 

180 185 190 

Gly Asp Ala His Phe Asp Asp Asp Glu Gin Trp Thr Lys Asp Thr Thr 

195 200 205 

Gly Thr Asn Leu Phe Leu Val Ala Ala His Glu He Gly His Ser Leu 

210 215 220 

Gly Leu Phe His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr 
225 230 235 240 

His Ser Leu Thr Asp Leu Thr Arg Phe Arg Leu Ser Gin Asp Asp He 

245 250 255 

Asn Gly He Gin Ser Leu. Tyr Gly Pro Pro Pro Asp Ser Pro Glu Thr 

260 26S 270 

Pro Leu Val Pro Thr Glu Pro Val Pro Pro Glu Pro Gly Thr Pro Ala 

275 280 285 

Asn Cys Asp Pro Ala Leu Ser Phe Asp Ala Val Ser Thr Leu Arg Gly 

290 295 300 

Glu lie Leu He Phe Lys Asp Arg His Phe Trp Arg Lys Ser Leu Arg 
305 310 315 320 

Lys Leu Glu Pro Glu Leu His Leu He Ser Ser Phe Trp Pro Ser Leu 

325 330 335 

Pro Ser Gly Val Asp Ala Ala Tyr Glu Val Thr Ser Lys Asp Leu Val 
340 345 350 



WO 98/40475 



71 



PCT/US98/04694 



Phe He Phe Lys Gly Asn Gin Phe Trp Ala He Arg Gly Asn Glu Val 

355 360 365 

Arg Ala Gly Tyr Pro Arg Gly He His Thr Leu Gly Phe Pro Pro Thr 

370 375 38° 

Val Arg Lys He Asp Ala Ala He Ser Asp Lys Glu Lys Asn Lys Thr 
385 -390 395 400 

Tyr Phe Phe Val Glu Asp Lys Tyr Trp Arg Phe Asp Glu Lys Arg Asn 

40S 415 ^ 

Ser Met Glu Pro Gly Phe Pro Lys Gin He Ala Glu Asp Phe Pro Gly 

420 *25 *30 

He Asp Ser Lys He Asp Ala Val Phe Glu Glu Phe Gly Phe Phe Tyr 

43S 440 
Phe Phe Thr Gly Ser Ser Gin Leu Glu Phe Asp Pro Asn Ala Lys Lys 

450 455 460 „ 

Val Thr His Thr Leu Lys Ser Asn Ser Trp Leu Asn Cys 
465 «0 475 

(2) INFORMATION FOR SEQ ID NO; 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 76 amino acids 

(B) TYPE: amino acid 

tC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: None 

{Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

Met Met His Leu Ala Phe Leu Val Leu Leu Cys Leu Pro Val Cys Ser 

5 10 15 

Ala Tyr Pro Leu Ser Gly Ala Ala Lys Glu Glu Asp Ser Asn Lys Asp 

20 25 30 

Leu Ala Gin Gin Tyr Leu Glu Lys Tyr Tyr Asn Leu Glu Lys Asp Val 

Lys Gin Phe Arg Arg Lys Asp Ser Asn Leu He Val Lys Lys He Gin 



50 55 60 



Gly Met Gin Lys Phe Leu Gly Leu Glu Val Thr Gly Lys Leu Asp Thr 

Asp Thr Leu Glu Val Met Arg Lys Pro Arg Cys Gly Val Pro Asp Val 

BS 9° 95 

Gly His Phe Ser Ser Phe Pro Gly Met Pro Lys Trp Arg Lys Thr His 

100 HO 

Leu Thr Tyr Arg He Val Asn Tyr Thr Pro Asp Leu Pro Arg Asp Ala 

115 120 125 

Val Asp Ser Ala lie Glu Lys Ala Leu Lys Val Trp Glu Glu Val Thr 

Pro Leu Thr Phe Ser Arg Leu Tyr Glu Gly Glu Ala Asp He Met He 
145 150 155 160 

Ser Phe Ala Val Lys Glu His Gly Asp Phe Tyr Ser Phe Asp Gly Pro 

165 "0 175 ^ 

Gly His Ser Leu Ala His Ala Tyr Pro Pro Gly Pro Gly Leu Tyr Gly 
180 185 i9 ° 
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Asp lie His Phe Asp Asp Asp Glu Lys Tr P Thr Clu Asp Ala Ser Gly 

195 200 205 

Th^ Asn Leu Phe Leu Val Ala Ala His Glu Leu Gly His Ser Leu Gly 

210 215 220 

Leu Phe His Ser Ala Asn Thr Glu Ala Leu Met Tyr Pro Leu Tyr Asn 

215 



III Phe Thr Glu Leu Gin Phe Are, Leu IVr Gin Asp Asp Val Asn 



Gly lie Gin Ser Leu Tyr Gly Pro Pro Pro Ala Ser Thr Glu Glu Pro 
L eu val Pro Thr Lys Ser Val Pro Ser Gly Ser Glu Met Pro Ala Lye 



2S0 



280 2BS 



Cys Asp Pro Ala Leu Ser Phe Asp Ala lie Ser Thr Leu Arg Gly Glu 
Tyr III Phe Phe Lys As P a" Tyr Phe Trp Arg Are Ser His Trp Asn 



III Glu Pro Glu Phe "is Leu He Ser Ala Phe Trp. Pro ser Leu Pro 



330 



ser Tyr Leu Asp Ala Ala Tyr Glu Val Asn Ser Arg Asp Thr Val Phe 

He Phe Lys g" Asn Glu Phe Trp m' H« ^g Gly Asn Glu Val Gin 

- 360 36d 

Ala Gly Tyr Pro Arg Gly lie His Thr Leu Gly Phe Pro Pro Thr lie 

T70 3" 380 

Arg Lys He Asp Ala Ala Val Ser Asp Lys Glu Lys Lys Lys Thr Tyr 

Phe Phe Ala Ala Asp Lys Tyr Trp Arg Phe Asp Glu Asn Ser Gin Ser, 

Met Glu Gin Gly Phe Pro Arg Leu lie Ala Asp Asp Phe Pro Gly Val 

420 4 25 0 0 

Glu Pro Lys Val Asp Ala Val Leu Gin Ala Phe Gly Phe Phe Tyr Phe 

435 440 
Phe Ser Gly Ser Ser Gin Phe Glu Phe Asp Pro Asn Ala Arg Met Val 

450 4 5 5 460 

Thr His lie Leu Lys Ser Asn Ser Trp Leu His Cys 

465 4 ™ 475 

(2) INFORMATION FOR SEQ ID NO: 23: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : None 

(Xi> SEQUENCE DESCRIPTION : SEQ ID NO-.23: 

Met His Ser Phe Pro Pro Leu Leu Leu Leu Leu Phe Trp Gly Val Val 

Ser His Ser Phe Pro Ala Thr Leu Glu Thr Gin Glu .Gin Asp Val Asp 

25 



20 
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Leu Val Gin Lys Tyr Leu Glu Lys Tyr Tyr Asn Leu Lys Asn Asp Gly 

35 40 45 

Arg Gin Val Glu Lya Arg Arg Asn Ser Gly Pro Val Val Glu Lys Leu 

50 55 60 

Lys Gin Met Gin Glu Phe Phe Gly Leu Lya Val Thr Gly Lys Pro Asp 
65 70 75 80 

Ala Glu Thr Leu Lya Val Met Lys Gin Pro Arg Cys Gly Val Pro Asp 

85 . 90 95 

Val Ala Gin Phe Val Leu Thr Glu Gly Asn Pro Arg Trp Glu Gin Thr 

100 105 110 

His Leu Thr Tyr Arg lie Glu Asn Tyr Thr Pro Asp Leu Pro Arg Ala 

115 120 . 125 

Asp Val Asp His Ala lie Glu Lys Ala Phe Gin Leu Trp Ser Asn Val 

130 135 140 

Thr Pro Leu Thr Phe Thr Lys Val Ser Glu Gly Gin Ala Asp lie Met 
145 150 155 160 

lie Ser Phe Val Arg Gly Asp His Arg Asp Asn Ser Pro Phe Asp Gly 

165 170 17S 

Pro Gly Gly Asn Leu Ala His Ala Phe Gin Pro Gly Pro Gly lie Gly 

160 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Arg Trp Thr Asn Asn Phe Arg 

195 200 20S 

Glu Tyr Asn Leu His Arg Val Ala Ala His Glu Leu Gly His Ser Leu 

210 215 220 

Gly Leu Ser His Ser Thr Asp lie Gly Ala Leu Met Tyr Pro Ser Tyr 
225 230 235 240 

Thr Phe Ser Gly Asp Val Gin Leu Ala Gin Asp Asp He Asp Gly He 

245 250 255 

Gin Ala He Tyr Gly Arg Ser Gin Asn Pro Val Gin Pro He Gly Pro 

260 265 270 

Gin Thr Pro Lys Ala Cys Asp Ser Lys Leu Thr Phe Asp Ala He Thr 

275 . 2B0 285 

Thr He Arg Gly Glu Val Met Phe Phe Lys Asp Arg Phe Tyr Met Arg 

290 295 300 

Thr Asn Pro Phe Tyr Pro Glu Val Glu Leu Asn Phe He Ser Val Phe 
305 310 315 320 

Trp Pro Gin Leu Pro Asn Gly Leu Glu Ala Ala Tyr Glu Phe Ala Asp 

325 330 335 

Arg Asp Glu Val Arg Phe Phe Lys Gly Asn Lys Tyr Trp Ala Val Gin 

340 345 350 

Gly Gin Asn Val Leu His Gly Tyr Pro Lys Asp He Tyr Ser Ser Phe 

355 360 365 

Gly Phe Pro Arg Thr Val. Lys His He Asp Ala Ala Leu Ser Glu Glu 

370 375 380 

Asn Thr Gly Lys Thr Tyr Phe Phe Val Ala Asn Lys Tyr Trp Arg" Tyr 
365 390 395 400 

Asp Glu Tyr Lys Arg Ser Met Asp Pro Gly Tyr Pro Lys Met He Ala 

40S 410 415 

His Asp Phe Pro Gly He Gly His Lys Val Asp Ala Val Phe Met Lys 

420 425 430 

Asp Gly Phe Phe Tyr Phe Phe His Gly Thr Arg Gin Tyr Lys Phe Asp 

435 440 445 

Pro Lys Thr Lys Arg He Leu Thr Leu Gin Lya Ala Asn Ser Trp Phe 
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450 455 460 

Asn Cys Arg Lys Asn 
465 

(2) INFORMATION FOR SEQ ID NO; 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : None 

Ui) SEQUENCE DESCRIPTION : SEQ ID NO:24: 

Met Phe Ser Leu Lya Thr Leu Pro Phe Leu Leu Leu. Leu His Val Gin 

15 10 15 

lie Ser Lys Ala Phe Pro Val Ser Ser Lya Glu Lys Asn Thr Lys Thr 

20 25 30 

Val Gin Asp Tyr Leu Glu Lys Phe Tyr Gin Leu Pro Ser Asn Gin Tyr 

35 40 45 

Gin Ser Thr Arg Lys Asn Gly Thr Asn Val He Val Glu Lys Leu Lys 

SO 55 60 

Glu Met Gin Arg Phe Phe Gly Leu Asn Val Thr Gly Lys Pro Asn Glu 
65 .70 75 80 

Glu Thr Leu Asp Met Met Lys Lys Pro Arg Cys Gly Val Pro Asp Ser 

B5 30 95 

Gly Gly Phe Met Leu Thr Pro Gly Asn Pro Lys Trp Glu Arg Thr Asn 

100 105 110 

Leu Thr Tyr Arg He Arg Asn Tyr Thr Pro Gin Leu Ser Glu Ala Glu 

115 120 125 

Val Glu Arg Ala He Lys Asp Ala Phe Glu Leu Trp Ser Val Ala Ser 

130 135 140 

Pro Leu He Phe Thr Arg He Ser Gin Gly Glu Ala Asp He Asn He 
145 150 155 160 ' 

Ala Phe Tyr Gin Arg Asp His Gly Asp Asn Ser Pro Phe Asp Gly Pro 

165 170 175 

Asn Gly He Leu Ala His Ala Phe Gin Pro Gly Gin Gly He Gly Gly 

180 185 190 

Asp Ala His Phe Asp Ala Glu Glu Thr Trp Thr Asn Thr Ser Ala Asn 

195' 200 205 

Tyr Asn Leu Phe Leu Val Ala Ala His Glu Phe Gly His Ser Leu Gly 

210 215 220 

Leu Ala His Ser Ser Asp Pro Gly Ala Leu Met Tyr Pro Asn Tyr Ala 
225 230 235 240 

Phe Arg Glu Thr Ser Asn Tyr Ser Leu Pro Gin Asp Asp He Asp Gly 

245 250 255 

He Gin Ala He Tyr Gly Leu Ser Ser Asn Pro He Gin Pro Thr Gly 

260 265 270 

Pro Ser Thr Pro Lys Pro Cys Asp Pro Ser Leu Thr Phe Asp Ala He 

275 280 285 

Thr Thr Leu Arg Gly Glu lie Leu Phe Phe Lys Asp Arg Tyr Phe Trp 
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290 295 300 

Arg Arg His Pro Gin Leu Gin Arg Val Glu Met Asn Phe lie Ser Leu 
305 310 315 320 

Phe Trp Pro Ser Leu Pro Thr Gly He Gin Ala Ala Tyr Glu Asp Phe 

325 330 335 

Asp Arg Asp Leu lie Phe Leu Phe Lys Gly Asn Gin Tyr Trp Ala Leu 

340 345 3S0 

Ser Gly Tyr Asp He Leu Gin Gly Tyr Pro Lys Asp He Ser Asn Tyr 

355 360 355 

Gly Phe Pro Ser Ser Val Gin Ala He Asp Ala Ala Val Phe Tyr Arg 

370 375 380 

Ser Lys Thr Tyr Phe Phe Val Asn Asp Gin Phe Trp Arg Tyr Asp Asn 
385 390 395 ^ 400 

Gin Arg Gin Phe Met Glu Pro Gly Tyr Pro Lys Ser He Ser Gly Ala 

405 410 415 

Phe Pro Gly He Glu Ser Lys Val Asp Ala Val Phe Gin Gin Glu His 

420 425 430 

Phe Phe His Val Phe Ser Gly Pro Arg Tyr Tyr Ala Phe Asp Leu He 

435 440 445 

Ala Gin Arg Val Thr Arg Val Ala Arg Gly Asn Lys Trp Leu Asn Cys 

450 4SS 460 

Arg Tyr Gly 
465 

(2) INFORMATION FOR SEQ ID NO: 25: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met His Pro Gly Val Leu Ala Ala Phe Leu Phe Leu Ser Trp Thr His 

15 10 15 

Cys Arg Ala Leu Pro Leu Pro Ser Gly Gly Asp Glu Asp Asp Leu Ser 

20 25 30 

Glu Glu Asp Leu Gin Phe Ala Glu Arg Tyr Leu Arg Ser Tyr Tyr His 

35 40 45 

Pro Thr Asn Leu Ala Gly He Leu Lys Glu Asn Ala Ala Ser Ser Met 

50 55 60 

Thr Glu Arg Leu Arg Glu Met Gin Ser Phe Phe Gly Leu Glu Val Thr 
65 70 75 80 

Gly Lys Leu Asp Asp Asn Thr Leu Asp Val Met Lys Lys Pro Arg Cys 

B5 90 95 

Gly Val Pro Asp Val Gly Glu Tyr Asn Val Phe Pro Arg Thr Leu Lys 

100 , 105 HO 

Trp Ser Lys Met Asn Leu Thr Tyr Arg He Val Asn Tyr Thr Pro Asp 

115 120 13 5 

Met Thr His Ser Glu Val Glu Lys Ala Phe Lys Lys Ala Phe Lys Val 
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130 135 140 

Trp Ser Asp Val Thr Pro Leu Asn Phe Thr Arg Leu His Asp Gly He 
145 150 155 160 

Ala Asp He Met lie Ser Phe Gly He Lys Glu His Gly Asp Phe Tyr 

165 170 175 

Pro Phe Asp Gly Pro Ser Gly Leu Leu Ala His Ala Phe Pro Pro Gly 

180 IBS 190 

Pro Asn Tyr Gly Gly Asp Ala His Phe Asp Asp Asp Glu Thr Trp Thr 

195 200 205 

Ser Ser Ser Lys Gly Tyr Asn Leu Phe Leu Val Ala Ala His Glu Phe 

210 215 220 

Gly His Ser Leu Gly Leu Asp His Ser Lys Asp Pro Gly Ala Leu Met 
225 230 235 240 

Phe Pro He Tyr Thr Tyr Thr Gly Lys Ser His Phe Met Leu Pro Asp 

245 250 255 

Asp Asp Val Gin Gly He Gin Ser Leu Tyr Gly Pro Gly Asp Glu Asp 

260 265 270 

Pro Asn Pro Lys His Pro Lys Thr Pro Asp Lys Cys Asp Pro Ser Leu 

275 280 285 

Ser Leu Asp Ala He Thr Ser Leu Arg Gly Glu Thr Met He Phe Lys 

290 295 300 

Asp Arg Phe Phe Trp Arg Leu His Pro Gin Gin Val Asp Ala Glu Leu 
305 310 315 320 

Phe Leu Thr Lys Ser Phe Trp Pro Glu Leu Pro Asn Arg He Asp Ala 

325 330 335 

Ala Tyr Glu His Pro Ser His Asp Leu He Phe He Phe Arg Gly Arg 

340 34S 350 

Lys Phe Trp Ala Leu Asn Gly Tyr Asp He Leu Glu Gly Tyr Pro Lys 

355 360 365 

Lys He Ser Glu Leu Gly Leu Pro Lys Glu Val Lys Lys He Ser Ala 

370 375 380. 

Ala Val His Phe Glu Asp Thr Gly Lys Thr Leu Leu Phe Ser Gly Asn 
385 390 395 400 

Gin Val Trp Arg Tyr Asp Asp Thr Asn His He Met Asp Lys Asp Tyr 

405 410 415 

Pro Arg Leu He Glu Glu Asp Phe Pro Gly He Gly Asp Lys Val Asp 

420 425 430 

Ala Val Tyr Glu Lys Asn Gly Tyr He Tyr Phe Phe Asn Gly Pro He 

435 440 445 

Gin Phe Glu Tyr Ser He Trp Ser Asn Arg He Val Arg Val Met Pro 

450 455 460 

Ala Asn Ser He Leu Trp Cys 
4 65 470 

(2) INFORMATION FOR SEQ ID NO: 26: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 amino acids 

( B ) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(iij MOLECULE TYPE: None 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Met Lys Phe Leu Leu lie Leu Leu Leu Gin Ala Thr Ala Ser Gly Ala 

IS io is 

Leu Pro Leu Asn Ser Ser Thr Ser Leu Glu Lys Asn Asn Val Leu Phe 

20 25 30 

Gly Glu Arg Tyr Leu Glu Lys Phe Tyr Gly Leu Glu lie Asn Lys Leu 

35 40 45 

Pro Val Thr Lys Met Lys Tyr Ser Gly Asn Leu Met Lys Glu Lys lie 

50 55 60 

Gin Glu Met Gin His Phe Leu Gly Leu Lys Val Thr Gly Gin Leu Asp 
65 70 75 SO 

Thr Ser Thr Leu Glu Met Met His Ala Pro Arg Cys Gly Val Pro Asp 

6S 90 95 

Val His His Phe Arg Glu Met Pro Gly Gly Pro Val Trp Arg Lys His 

100 105 110 

Tyr lie Thr Tyr Arg* lie Asn Asn Tyr Thr Pro Asp Met Asn Arg Glu 

115 120 12S 

Asp Val Asp Tyr Ala He Arg Lys Ala Phe Gin Val Trp Ser Asn Val 

130 135 140 

Thr Pro Leu Lys Phe Ser Lys He Asn Thr Gly Met Ala Asp He Leu 
145 150 155 160 

Val Val Phe Ala Arg Gly Ala His Gly Asp Phe His Ala Phe Asp Gly 

165 170 175 

Lys Gly Gly He Leu Ala His Ala Phe Gly Pro Gly Ser Gly He Gly 

180 185 190 

Gly Asp Ala His Phe Asp Glu Asp Glu Phe Trp Thr Thr His Ser Gly 

195 200 205 

Gly Thr Asn Leu Phe Leu Thr Ala Val His Glu He Gly His Ser Leu 

210 215 220 

Gly Leu Gly His Ser Ser Asp Pro Lys Ala Val Met Phe Pro Thr Tyr 
225 230 235 240 

Lys Tyr Val Asp He Asn Thr Phe Arg Leu Ser Ala Asp Asp He Arg 

245 250 255 

Gly He Gin Ser Leu Tyr Gly Asp Pro Lys Glu Asn Gin Arg Leu Pro 

260 265 270 

Asn Pro Asp Asn Ser Glu Pro Ala Leu Cys Asp Pro Asn Leu Ser Phe 

275 280 285 

Asp Ala Val Thr Thr Val Gly Asn Lys He Phe Phe Phe Lys Asp Arg 

290 295 300 

Phe Phe Trp Leu Lys Val Ser Glu Arg Pro Lys Thr Ser Val Asn Leu 
305 310 315 320 

He Ser Ser Leu Trp Pro Thr Leu Pro Ser Gly He Glu Ala Ala Tyr 

325 330 335 

Glu He Glu Ala Arg Asn Gin Val Phe Leu Phe Lys Asp Asp Lys Tyr 

340 345 350 

Trp Leu He Ser Asn Leu Arg Pro Glu Pro Asn Tyr Pro Lys Ser He 

355 360 365 

His Ser Phe Gly Phe Pro Asn Phe Val Lys Lys He Asp Ala Ala Val 

370 375 380 

Phe Asn Pro Arg Phe Tyr Arg Thr Tyr Phe Phe Val Asp Asn Gin Tyr 
385 390 395 400 

Trp Arg Tyr Asp Glu Arg Arg Gin Met Met Asp Pro Gly Tyr Pro Lys 
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Leu lie Thr Lys Asn Phe Gin Gly 
420 

Phe Tyr Ser Lys Asn Lys Tyr Tyr 
435 440 
Phe Glu Tyr Asp Phe Leu Leu Gin 

450 455 
Asn Ser Trp Phe Gly Cys 
465 470 



410 415 
lie Gly Pro Lys lie Asp Ala Val 
425 430 
Tyr Phe Phe Gin Gly Ser Asn Gin 
445 

Arg lie Thr Lys Thr Leu Lys Ser 
460 



(2) INFORMATION FOR SEQ "ID NO:27; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE • TYPE : None 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 27: 



Met 
1 

Ala 

Glu 

Thr 

Phe 

£5 

He 

Leu 

He 

Val 

Arg 
145 
Gly 

Ala 

Asp 

Leu 

Ser 
225 
Pro 



Arg Leu 

Leu Pro 

Gin Ala 

35 
Lys Asn 
50 

Phe Gly 

Met Gin 

Phe Pro 

Val Ser 
115 
Ser Lys 
130 

Lys Val 

Ala His 

Kis Ala 

Glu Asp 
195 
Tyr Ala 
210 

Ser Asp 



Thr Val 
5 

Leu Pro 
20 

Gin Asp 

Ala Asn 

Leu Pro 

Lys Pro 

85 
Asn Ser 
100 

Tyr Thr 

Ala Leu 

Val Trp 

Gly Asp 
165 
Phe Ala 
180 

Glu Arg 



Leu Cys Ala Val 
Gin Glu 
Tyr Leu 



Ala Thr 
Pro Asn 
Gin Asn Phe Lys 



Ser Leu 

55 
He Thr 
70 

Arg Cys 

Pro Lys 

Arg Asp 

Asn Met 
135 
Gly Thr 
150 

Ser Tyr 

Pro Gly 

Trp Thr 

His Glu 
215 
Ala Val 
230 

Leu Ser 



Ala Gly 

25 
Lys Arg 
40 

Glu Ala 



Gly Met 

Gly Val 

Trp Thr 
105 
Leu Pro 
120 

Trp Gly 

Ala Asp 

Pro Phe 

Thr Gly 
185 
Asp Gly 
200 

Leu Gly 
Met Tyr 
Gin Asp 



Cys Leu Leu 
10 

Gly Met Ser 

Phe Tyr Leu 

Lys Leu Lys 

60 . 

Leu Asn Ser 
75 

Pro Asp Val 
90 

Ser Lys Val 

His He Thr 

Lys Glu He 
140 

He Met He 

155 . 
Asp Gly Pro 
170 

Leu Gly Gly 

Ser Ser Leu 

His Ser Leu 
220 

Pro Thr Tyr 

235 
Asp He Lys 



Pro Gly 

Glu Leu 

30 
Tyr Asp 
45 

Glu Met 

Arg val 

Ala Glu 

Val Thr 
110 
Val Asp 
125 

Pro Leu 

Gly Phe 

Gly Asn 

Asp Ala 
190 
Gly He 
205 

Gly Met 
Gly Asn 

Gly He 



Ser Leu 
15 

Gin Trp 

Ser Glu 

Gin Lys 

He Glu 

. 80 
Tyr Ser 
95 

Tyr Arg 

Arg Leu 

Kis Phe 

Ala Arg 
ISO 
Thr Leu 
175 

His Phe 

Asn Phe 

Gly His 

Gly Asp 
240 
Gin Lys 
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245 250 
Leu Tyr Gly Lys Arg Ser Asn Ser Arg Lys Lys 
260 265 



255 



(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE : None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;28: 
Met Ser Pro Ala Pro Arg Pro Ser Arg Cys Leu Leu Leu Pro Leu Leu 



Thr Leu Gly Thr Ala Leu Ala Ser Leu Gly Ser Ala Gin Ser Ser Ser. 

20 25 30 

Phe Ser Pro Glu Ala Trp Leu Gin Gin Tyr Gly Tyr Leu Pro Pro Gly 

35 40 45 

Asp Leu Arg Thr His Thr Gin Arg Ser Pro Gin Ser Leu Ser Ala Ala 

50 55 60 

He Ala Ala Met Gin Lys Phe Tyr Gly Leu Gin Val Thr Gly Lys Ala 
65 70 75 80 

Asp Ala Asp Thr Met Lys Ala Met Arg Arg Pro Arg Cys Gly Val Pro 

85 90 95 

Asp Lys Phe Gly Ala Glu He Lys Ala Asn Val Arg Arg Lys Arg Tyr 

100 105 110 

Ala He Gin Gly Leu Lys Trp Gin His Asn Glu He Thr Phe Cys He 

115 120 125 

Gin Asn Tyr Thr Pro Lys Val Gly Glu Tyr Ala Thr Tyr Glu Ala He 

130 135 140 

Arg Lys Ala Phe Arg Val Trp Glu Ser Ala Thr Pro Leu Arg Phe Arg 
145 ISO 155 160 

Glu Val Pro Tyr Ala Tyr He Arg Glu Gly His Glu Lys Gin Ala Asp 

165 170 175 

He Met He Phe Phe Ala Glu Gly Phe His Gly Asp Ser Thr Pro Phe 

180 185 190 

Asp Gly Glu Gly Gly Phe Leu Ala His Ala Tyr Phe Pro Gly Pro Asn 

195 200 205 

He Gly Gly Asp Thr His Phe Asp Ser Ala Glu Pro Trp Thr Val Arg 

210 t 215 220 

Asn Glu Asp Leu Asn Gly Asn Asp He Phe Leu Val 'Ala Val His Glu 
225 230 235 240 

Leu Gly His Ala Leu Gly Leu Glu His Ser Ser Asp Pro Ser Ala He 

245 250 255 

Met Ala Pro Phe Tyr Gin Trp Met Asp Thr Glu Asn Phe Val Leu Pro 

260 265 270 

Asp Asp Asp Arg Arg Gly He Gin Gin Leu Tyr Gly Gly Glu Ser Gly 

275 280 ^35 

Phe Pro Thr Lys Met Pro Pro Gin Pro Arg Thr Thr Ser Arg Pro Ser 



1 



5 



10 



15 
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20 25 30 

Leu Leu Val Leu Leu Gly Cys Leu Gly Leu Gly Val Ala Ala Glu Asp 

35. 40 45 

Ala Glu Val His Ala Glu Asn Trp Leu Arg Leu Tyr Gly Tyr Leu Pro 

50 SS 60 

Gin Pro Ser Arg His Met Ser Thr Met Arg Ser Ala Gin He Leu Ala 
65 70 75 80 

Ser Ala Leu Ala Glu Met Gin Arg Phe Tyr Gly He Pro Val Thr Gly 

85 90 , 95 

Val Leu Asp Glu Glu Thr Lys Glu Trp Met Lys Arg Pro Arg Cys Gly 

100 105 110 

Val Pro Asp Gin Phe Gly Val Arg Val Lys Ala Asn Leu Arg Arg Arg 

115 120 125 

Arg Lys Arg Tyr. Ala Leu Thr Gly Arg Lys Trp Asn Asn His His Leu 

130 135 140 

Thr Phe Ser He Gin Asn Tyr Thr Glu Lys Leu Gly Trp Tyr His Ser 
145 150 155 160 

Met Glu Ala Val Arg Arg Ala Phe Arg Val Trp Glu Gin Ala Thr Pro 

165 170 175 

Leu Val Phe Gin Glu Val Pro Tyr Glu Asp He Arg Leu Arg Arg Gin 

• 180 185 ' 190 

Lys Glu Ala Asp He Met Val Leu Phe Ala Ser Gly Phe His Gly Asp 

195 200 205 

Ser Ser Pro Phe Asp Gly Thr Gly Gly Phe Leu Ala His Ala Tyr Phe 

210 215 220 

Pro Gly Pro Gly Leu Gly Gly Asp Thr His Phe Asp Ala Asp Glu Pro 
225 230 235 240 

Trp Thr Phe Ser Ser Thr Asp Leu His Gly .Asn Asn Leu Phe Leu Val 

245 250 255 

Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His Ser Ser Asn 

260 265 270 

Pro Asn Ala He Met Ala Pro Phe Tyr Gin Trp Lys Asp Val Asp Asn 

275 280 285 

Phe Lys Leu Pro Glu Asp Asp Leu Arg Gly He Gin Gin Leu Tyr Gly 

290 295 300 

Thr Pro Asp Gly Gin Pro Gin Pro Thr Gin Pro Leu Pro Thr Val Thr 
305 310 315 320 

Pro Arg Arg Pro Gly Arg Pro Asp His Arg Pro Pro Arg Pro Pro Gin 

325 f 330 335 

Pro Pro Pro Pro Gly Gly Lys Pro Glu Arg Pro Pro Lys Pro Gly Pro 

340 ... 34S 350 

Pro Val Gin Pro Arg Ala Thr Glu Arg Pro Asp Gin Tyr Gly Pro Asn 

355 360 365 

He Cys Asp Gly Asp Phe Asp Thr Val Ala Met Leu Arg Gly Glu Met 

370 375 380 

Phe Val Phe Lys Gly Arg Trp Phe Trp Arg Val Arg His Asn Arg Val 
385 390 395 400 

Leu Asp Asn Tyr Pro Met Pro He Gly His Phe Trp Arg Gly Leu Pro 

405 410 415 

Gly Asp He Ser Ala Ala Tyr Glu Arg Gin Asp Gly Arg Phe Val Phe 

-i:o 425 430 

Phe Lys Gly Asp Arg Tyr Trp Leu Phe Arg Glu Ala Asn Leu Glu Pro 
435 440 445 
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Gly Tyr Pro Gin Pro Leu Thr Ser Tyr Gly Leu Gly lie Pro Tyr Asp 

450 4S5 46Q 

Arg lie Asp Thr Ala He Trp Trp Glu Pro Thr Gly His Thr Phe Phe 
465 47 ° 475 400 

Phe Gin Glu Asp Arg Tyr Trp Arg Phe Asn Glu Glu Thr Gin Arg Gly 

485 490 495 

Asp Pro Gly Tyr Pro Lys Pro He Ser Val Trp Gin Gly He Pro Ala 

500 505 510 

Ser Pro Lys Gly Ala Phe Leu Ser Asn Asp Ala Ala Tyr Thr Tyr Phe 

515 S20 S25 

Tyr Lys Gly Thr Lys Tyr Trp Lys Phe Asp Asn Glu Arg Leu Arg Met 

530 535 540 

Glu Pro Gly Tyr Pro Lys Ser He Leu Arg Asp Phe Met Gly Cys Gin 
545 550 555 5S0 

Glu His Val Glu Pro Gly Pro Arg Trp Pro Asp Val Ala Arg Pro Pro 

56S 570 575 

Phe Asn Pro His Gly Gly Ala Glu Pro Gly Ala Asp Ser Ala Glu Gly 

580 585 590 

Asp Val Gly Asp Gly Asp Gly Asp Phe Gly Ala Gly Val Asn Lys Asp 

59S 600 60S 

Gly Gly Ser Arg Val Val Val Gin Met Glu Glu Val Ala Arg Thr Val 

610 615 620 

Asn Val, Val Met Val Leu Val Pro Leu Leu Leu Leu Leu Cys Val Leu 
625 630 635 640 

Gly Leu Thr Tyr Ala Leu Val Gin Met Gin Arg Lys Gly Ala Pro Arg 

645 650 655 

Val Leu Leu Tyr Cys Lys Arg Ser Leu Gin Glu Trp Val 
660 665 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 604 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met He Leu Leu Thr Phe Ser Thr Gly Arg Arg Leu Asp Phe Val His 

1 5 io is 

His Ser Gly Val Phe Phe Leu Gin Thr Leu Leu Trp lie Leu Cys Ala 

20 25 30 

Thr Val Cys Gly Thr Glu Gin Tyr Phe Asn Val Glu Val Trp Leu Gin 

35 40 * 45 

Lys Tyr Gly Tyr Leu Pro Pro Thr Ser Pro Arg Met Ser Val Val Arg 

SO 55 60 

Ser Ala Glu Thr Met Gin Ser Ala Leu Ala Ala Met Gin Gin Phe Tyr 
65 70 75 SO 

Gly He Asn Met Thr Gly Lys Val Asp Arg Asn Thr He Asp Trp Met 
85 90 95 
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Lys Lys Pro Arg Cys Gly Val Pro Asp Gin Thr Arg Gly Ser Ser Lys 

100 105 no 

Phe His He Arg Arg Lys Arg Tyr Ala Leu Thr Gly Gin Lys Trp Gin 

115 - 120 12S 

His Lys His He Thr Tyr Ser He Lys Asn Val Thr Pro Lys Val Gly 

130 135 140 

Asp Pro Glu Thr Arg Lys Ala lie Arg Arg Ala Phe Asp Val Trp Gin 
145 150 155 160 

Asn Val Thr Pro Leu Thr Phe Glu Glu Val Pro Tyr Ser Glu Leu Glu 

165 170 175 

Asn Gly Lys Arg Asp Val Asp He Pro He He Phe Ala Ser Gly Phe 

180 185 190 

His Gly Asp Ser Ser Pro Phe Asp Gly Glu Gly Gly Phe Leu Ala His 

19S 200 20S 

Ala Tyr Phe Pro Gly Pro Gly He Gly Gly Asp Thr His Phe Asp Ser 

210 215 220 

Asp Glu Pro Trp Thr Leu Gly Asn Pro Asn His Asp Gly Asn Asp Leu 
225 230 235 240 

Phe Leu Val Ala Val His Glu Leu Gly His Ala Leu Gly Leu Glu His 

245 250 255 

Ser Asn Asp Pro Thr Ala He Met Ala Pro Phe Tyr Gin Tyr Met Glu 

260 265 270 

Gin Thr Leu Gin Leu Pro Asn Asp Asp Tyr Arg His Gin Arg Tyr Met 

275 280 285 

Ser Pro Asp Lys lie Pro Pro Pro Thr Arg Pro Leu Pro Thr Val Pro 

290 295 300 

Pro His Arg Ser He Pro Pro Ala Asp Pro Arg Lys Asn Asp Arg Pro 
305 310 315 320 

Lys Pro Pro Arg Pro Pro Thr Gly Arg Pro Ser Tyr Pro Gly Ala Lys 

325 330 335 

Pro Asn He Cys Asp Gly Asn Phe Asn Thr Leu Ala He Leu Arg Arg 

340 345 350 ■ 

Glu Met Phe Val Phe Lys Asp Gin Trp Phe Trp Arg Val Arg Asn Asn 

355 360 365 

Arg Val Met Asp Gly Tyr Pro Met Gin He Thr Tyr Phe Trp Arg- Gly 

370 375 380 

Leu Pro Pro Ser He Asp Ala Val Tyr Glu Asn Ser Asp Gly Asn Phe 
385 390 395 400 

Val Phe Phe Lys Gly Asn Lys Tyr Trp Val Phe Lys Asp Thr Thr Leu 

405 • 410 415 

Gin Pro Gly Tyr Pro His Asp Leu He Thr Leu Gly Ser Gly He Pro 

420 425 430 

Pro His Gly He Asp Ser Ala He Trp Trp Glu Asp Val Gly Lys Thr 

435 440 445 

Tyr Phe Phe Lys Gly Asp Arg Tyr Trp Arg Tyr Ser Glu Glu Met Lys 

450 455 460 

Thr Met Asp Pro Gly Tyr Pro Lys Pro He Thr Val Trp Lys Gly He 
465 470 475 490 

Pro Glu Ser Pro Gin Gly Ala Phe Val His Lys Glu Asn Gly Phe Thr 

485 490 495 

Tyr Phe Tyr Lys Glu Gly Val Leu Glu He Gin Thr Thr Arg Tyr Ser 

500 505 510 

Arg Leu Glu Pro Gly His Pro Arg Ser He Leu Lys Asp Leu Ser Gly 
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515 520 
Cys Asp Gly Pro Thr Asp Arg Val 

530 535 
Asp Val Asp He Val He Lys Leu 
545 5S0 
Ala He Ala He Val He Pro Cys 
' 565 

Leu Val Tyr Thr Val Phe Gin Phe 
580 

He Leu Tyr Cys Lys Axg Ser Met 
595 600 



64 

525 

Lys Glu Gly His Ser Pro Pro Asp 
540 

Asp Asn Thr Ala Ser Thr Val Lys 

555 560 ' 

He Leu Ala Leu Cys Leu Leu Val 

570 S75 
Lys Arg Lys Gly Thr Pro Arg His 
535 590 
Gin Glu Trp Val 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Ala Pro Ala Ala Trp Leu Arg Ser Ala Ala Ala Arg Ala Leu Leu 

I 5 10 15 

Pro Pro Met Leu Leu Leu Leu Leu Gin Pro- Pro Pro Leu Leu Ala Arg 

20 25 30 

Ala Leu Pro Pro Asp Val His His Leu His" Ala Glu Arg Arg Gly Pro 

35 40 .45 

Gin Pro Trp His Ala Ala Leu Pro Ser Ser Pro Ala Pro Ala Pro Ala 

50 55 60 

Thr Gin Glu Ala Pro Arg Pro Ala Ser Ser Leu Arg Pro Pro Arg Cys 
65 70 75 80 

Gly Val Pro Asp Pro Ser Asp Gly Leu Ser Ala Arg Asn Arg Gin Lys 

85 90 95 

Arg Phe Val Leu Ser Gly Gly Arg Trp Glu Lys Thr Asp Leu Thr Tyr 

100 105 110 

Arg He Leu Arg Phe Pro Trp Gin Leu Val Gin Glu Gin Val Arg Gin 

115 120 125 

Thr Met Ala Glu Ala Leu Lys Val Trp Ser Asp Val Thr Pro Leu Thr 

130 135 140 

Phe Thr Glu Val His Glu Gly Arg Ala Asp He Met He Asp Phe Ala 
145 150 155 . 160 

Arg Tyr Trp Asp Gly Asp Asp Leu Pro Phe Asp Gly Pro Gly Gly He 

165 170 175 

Leu Ala His Ala Phe Phe Pro Lys Thr His Arg Glu Gly Asp Val His 

180 185 190 

Phe Asp Tyr Asp Glu Thr Trp Thr He Gly Asp Asp Gin Gly Thr Asp 

195 200 20S 

Leu Leu Gin Val Ala Ala His Glu Phe Gly^His Val Leu Gly Leu Gin 

210 21S . 220 

His Thr Thr Ala Ala Lys Ala Leu Met Ser Ala Phe Tyr Thr Phe Arg 
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22 5 










230 


Tyr 


Pro 


Leu 


Ser 


Leu 


Ser 










24S 




Tyr 


Gly 


Gin 


Pro 


Trp 


Pro 








260 






Pro 


Gin 


Ala 


Gly 


He 


Asp 






275 








Ala 


Pro 


Pro 


Asp 


Ala 


Cys 




290 










Arg Gly Glu 


Leu 


Phe 


Phe 


305 










310 


Gly Gly 


Gin 


Leu 


Gin 


Pro 










325 




Gin Gly 


Leu 


Pro 


Ser 


Pro 








340 






Hi9 


lie 


Trp 


Phe 


Phe 


Gin 






355 








Lys 


Pro 


Val 


Leu Gly 


Pro 




370 










Phe 


Pro 


Val 


His 


Ala 


Ala 


385 










390 


Tyr 


Phe 


Phe 


Arg 


Gly Arg 










405 




Arg 


Val 


Asp 


Ser 


Pro 


Val 








420 






Pro 


Ser 


Glu 


lie 


Asp 


Ala 






435 








Phe 


Leu 


Arg 


Gly Arg 


Leu 




450 










Ala 


Leu 


Glu 


Gly 


Phe 


Pro 


465 










470 


Ala 


Glu 


Pro 


Ala 


Asn 


Thr 










485 





235 240 
Pro Asp Asp Cys Arg Gly Val Gin His Leu 

250 255 
Thr Val Thr Ser Arg Thr Pro Ala Leu. Gly 

265 270 
Thr Asn Glu He Ala Pro Leu Glu Pro Asp 

280 285 
Glu Ala Ser Phe Asp Ala Val Ser Thr lie 
295 300 

Phe Lys Ala Gly Phe Val Trp Arg Leu Arg 
315 320 
Gly Tyr Pro Ala Leu Ala Ser Arg His Trp 

330 335 
Val Asp Ala Ala Phe Glu Asp Ala Gin Gly 

34S 350 
Gly Ala Gin Tyr Trp Val Tyr Asp Gly Glu 

360 365 
Ala Pro Leu Thr Glu Leu Gly Leu Val Axg 
375 380 

Leu Val Trp Gly Pro Glu Lys Asn Lys He 
395 400 
Asp Tyr Trp Arg Phe His Pro Ser Thr Arg 

410 415 
Pro Arg Arg Ala Thr Asp Trp Arg Gly Val 

425 430 
Ala Phe Gin Asp Ala Asp Gly Tyr Ala Tyr 

440 445 
Tyr Trp Lys Phe Asp Pro Val Lys Val Lys 
455 460 
Arg Leu Val Gly Pro Asp Phe Phe Gly Cys 
475 480 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 517 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



tii) MOLECULE TYPE: None 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gin Gin Phe Gly Gly Leu Glu Ala Thr Gly He Asp Glu Ala Thr 

1 5 - 10 15 

Leu Ala Leu Met Lys Thr Pro Arg Cys Ser Leu Pro Asp Leu Pro Val 

20 25 30 

Leu Thr Gin Ala Arg Arg Arg Arg Gin Ala Pro Ala Pro Thr Lys Trp 

35 40 45 

Asn Lys Arg Asn Leu Ser Trp Arg Val Arg Tnr Phe Pro Arg Asp Ser 
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50 










55 










60 










Pro 


Leu 


Gly 


His 


Asp 


Thr 


Val 


Arg 


Ala 


Leu 


Met 


Tyr 


Tyr 


Ala 


Leu 


Lys 


65 










70 










75 










eo 


Val 


Trp 


Ser 


Asp 


He 
85 


Ala 


Pro 


Leu 


Asn 


Phe 
90 


His 


Glu 


Val 


Ala 


Gly 
95 


Ser 


Thr 


Ala 


Asp 


lie 
100 


Gin 


He 


Asp 


Phe 


Ser 
105 


Lys 


Ala 


Asp 


His 


Asn 
110 


Asp 


Gly 


Tyr 


Pro 


Phe 


Asp 


Ala 


Arg Arg 


His 


Arg 


Ala 


His 


Ala 


Phe 


Phe 


Pro Gly 




115 










120 










125 








His 


His 


His 


Thr 


Ala 


Gly Tyr 


Thr 


His 


Phe 


Asn 


Asp 


Asp 


Glu 


Ala 


Trp 




130 










135 










140 










Thr 


Phe 


Arg 


Ser 


Ser 


Asp Ala His Gly Met Asp 


Leu 


Phe 


Ala 


Val 


Ala 


145 








150 










155 










160 


Val 


His 


Glu 


Phe Gly 


His 


Ala 


He 


Gly Leu 


Ser 


His 


Val 


Ala 


Ala 


Ala 










165 










170 










175 




His 


Ser 


He 


Met 


Arg 


Pro 


Tyr Tyr 


Gin Gly 


Pro 


Val 


Gly 


Asp 


Pro 


Leu 








ISO 










185 










190 






Arg 


Tyr 


Gly 


Leu 


Pro 


Tyr Glu Asp Lys 


Val 


Arg 


Val 


Trp 


Gin 


Leu Tyr 


195 










200 










205 








Gly val 


Arg 


Glu 


Ser 


Val 


Ser 


Pro 


Thr 


Ala 


Gin 


Pro 


Glu 


Glu 


Pro 


Pro 




210 










215 










220 










Leu 


Leu 


Pro 


Glu 


Pro 


Pro 


Asp 


Asn 


Arg 


Ser 


Ser 


Ala 


Pro 


Pro 


Arg 


Lys 


225 










230 










235 










240 


Asp 


val 


Pro 


His 


Arg 


Cys 


Ser 


Thr 


His 


Phe 


Asp 


Ala 


Val 


Ala 


Gin 


He 








245 










250 














Arg Gly 


Glu 


Ala 


Phe 


Phe 


Phe 


Lys 


Gly 


Lye 


Tyr 


Phe 


Trp 


Arg 


Leu 


Thr 








260 










265 










270 






Arg 


Asp 


Arg 


His 


Leu 


val 


Ser 


Leu 


Gin 


Pro 


Ala 


Gin 


Met 


His 


Arg 


Phe 


275 










280 










285 








Trp 


Arg 


Gly 


Leu 


Pro 


Leu 


His 


Leu 


Asp 


Ser 


Val 


Asp 


Ala 


Val 


Tyr 


Glu 


290 










295 










300 










Arg 


Thr 


Ser 


Asp 


His 


Lys 


He 


Val 


Phe 


Phe 


Lys 


Gly 


Asp 


Arg 


Tyr 


Trp 


305 










310 










315 










320 


Val 


Phe 


Lys 


Asp 


Asn 


Asn 


Val 


Glu 


Glu 


Gly Tyr 


Pro 


Arg 


Pro 


Val 


Ser 










325 










330 










335 




Asp 


Phe 


Ser 


Leu 


Pro 


Pro Gly Gly 


He 


Asp 


Ala 


Ala 


Phe 


Ser 


Trp 


Ala 






340 










345 










350 






His 


Asn 


Asp 
355 


Arg 


Thr 


Tyr 


Phe 


Phe 
360 


Lys 


Asp 


Gin 


Leu 


Tyr 
365 


Trp 


Arg 


Tyr 


Asp 


Asp 


His 


Thr Arg 


His 


Met 


Asp 


Pro Gly Tyr 


Pro 


Ala 


Gin 


Ser 


Pro 




370 










375 










380 










Leu 


Trp 


Arg 


Gly Val 


Pro 


Ser 


Thr 


Leu Asp Asp 


Ala 


Met 


Arg 


Trp 


Ser 


385 










390 










395 










400 


Asp Gly 


Ala 


Ser 


Tyr 


Phe 


Phe Arg Gly Gin 


Glu 


Tyr 


Trp 


Lys 


Val 


Leu 










405 










410 










415 




Asp 


Gly 


Glu 


Leu 
420 


Glu 


val 


Ala 


Pro 


Gly 
425 


Tyr 


Pro 


Gin 


Ser 


Thr 
430 


Ala 


Arg 


Asp 


Trp 


Leu 


Val 


Cys 


Gly Asp 


Ser 


Gin 


Ala 


Asp 


Gly 


Ser 


Val 


Ala 


Ala 






435 










440 










445 








Gly 


Val 
450 


Asp 


Ala 


Ala 


Glu 


Gly 
455 


Pro 


Arg 


Ala 


Pro 


Pro 
460 


Gly 


Gin 


His 


Asp 


Gin 


Ser 


Arg 


Ser 


Glu 


Asp Gly Tyr Glu 


Val 


Cys 


Ser 


Cys 


•Thr 


Ser Gly 


465 










470 










475 










480 
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Ala Ser Ser Pro Pro Gly Ala Pro Gly Pro Leu Val Ala Ala Thr Met 

485 490 495 

Leu Leu Leu Leu Pro Pro Leu Ser Pro Gly Ala Leu Trp Thr Ala Ala 

500 SOS 510 

Gin Ala Leu Thr Leu 
515 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 

. (S) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 



Arg Xaa Lys Arg 
1 
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What is claimed: 

1 . A purified polynucleotide or fragment thereof derived from human matrix 
metalloproteinase 19 (MMP19) gene wherein said purified polynucleotide is capable of 
selectively hybridizing to the nucleic acid of said MMP19 gene, and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 and fragments, analogs or 
complements thereof. 

. 2. The purified polynucleotide of Claim 1 wherein said purified polynucleotide is 
produced by recombinant techniques. 

3 . " The purified polynucleotide of Claim 2 wherein said polynucleotide 
produced by recombinant techniques comprises a sequence of at least one epitope encoded 
byMMP19. 

4. . The purified polynucleotide of Claim 3 wherein said epitope is selected 
from the group consisting of SEQ ID NO: 14. SEQ ED NO: 15, SEQ ID NO: 16, and SEQ 
ID NO: 17. 

5 . A purified polynucleotide of MMP19 which encodes an MMP19 protein 
comprising an amino acid sequence which has at least 40% identity to SEQ ID NO: 10. 

6. The purified polynucleotide of Claim 5 which encodes an MMP19 protein 
having SEQ ID NO: 10. 

7 . A recombinant expression system comprising a nucleic acid sequence that 
encodes an open reading frame derived from MMP19 which is opcrably linked to a control 
sequence compatible with a desired host and wherein said nucleic acid sequence is capable 
of selectively hybridizing to the nucleic acid of said MMP19 gene, and wherein said 
purified polynucleotide has at least 50% identity to SEQ ID NO: 1 and fragments, analogs 
or complements thereof. 

8 . A host cell comprising the recombinant expression system of Claim 7. 

9 . An MMP 1 9 polypeptide wherein said polypeptide has at least 40% identity 
to SEQ ID NO: 10 and fragments thereof. 
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10. The polypeptide of Claim 9 which further possesses the ability to cleave 
SEQIDNO: 13. 

1 1 . The polypeptide of Claim 9 selected from the group consisting of SEQ ID 
NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

12. A method of detecting the presence of a target polynucleotide of MMP 1 9 in 
a test sample, comprising: 

(a) contacting said test sample with at least one MMP 1 9 specific polynucleotide 
or complement thereof; and 

(b ) detecting the presence of said target polynucleotide of MMP 19 in the test 

sample; 

wherein said MMP 19 specific polynucleotide has at least 50% identity to polynucleotide 
SEQ ID NO: 1 and fragments, analogs or complements thereof. 

! 3. The method of Claim 14 wherein said target polynucleotide of MMP 19 is 
attached to a solid phase prior to performing step (a). 

14. A method for detecting mRN A of MMP 19 in a test sample, comprising: 

(a) contacting said test sample with at least one primer; 

(b) performing reverse transcription in order to produce cDNA; 

(c) amplifying said cDNA obtained from step (b) by using other 
oligonucleotide primer(s) of MMP19 as sense and antisense primer(s) in a first-stage 
amplification to obtain MMP 19 amplicon; 

(d) detecting the presence of said MMP 1 9 ampiicon in the test sample; 
wherein said oligonucleotide primers of MMP 19 have at least 50% identity to SEQ ID NO: 
1 and fragments, analogs or complements thereof. 

15. A method of detecting target MMP 19 polynucleotide in a test sample 
suspected of containing said target, comprising: 

(a) contacting said target MMP1 9 polynucleotide with at lease on MMP 19 
oligonucleotide as sense primer and with at least one MMP 19 oligonucleotide as an 
antisense primer and amplifying same to obtain a first stage reaction product; 

(b) contacting said first stage reaction product with at least one other MMP 19 
oligonucleotide, with the proviso that the other MMP 19 oligonucleotide is located 3' to the 
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MMP 19 oligonucleotides utilized in step (a) and is complementary to said first stage 
reaction product; and 

(c) detecting said target MMP 1 9 polynucleotide, 
wherein said MMP 1 9 oligonucleotides utilized in step (a) and step (b) have at least 50% 
identity to SEQ ID NO: 1 and fragments, analogs or complements thereof. 

16. The method of Claim 14 or Claim 15 wherein said test sample is reacted 
with a solid phase prior to performing step (a) or step (b) or step (c). 

1 7 . The method of Claim 1 4 or Claim 1 5 wherein said detection step comprises 
utilizing a detectable label capable of generating a measurable signal. 

18. A method for identifying an inhibitor compound of MMP 1 9 comprising the 
steps of: 

(a) providing a reaction mixture comprising 

(i) a substrate; 

(ii) an MMP19 protein having at least 40% identity to SEQ ID NO: 10 or 
fragments thereof, said MMP 19 protein further possessing the ability to cleave said 
substrate; and 

(iii) a compound of interest; 

(b) incubating the reaction mixture; and 

(c) determining the extent of cleavage of the substrate in the reaction mixture. 

19. . The method of Claim 1 8 wherein said substrate is SEQ ID NO: 1 3. 

20. The method of Claim 19 wherein said MMP 19 protein is SEQ ID NO: 10." 

21. An antibody which specifically binds to at least one epitope encoded by 
MMP 1 9, wherein said antibody is polyclonal or monoclonal and wherein said epitope 
comprises an amino acid sequence selected from the group consisting of SEQ ID NO: 14. 
SEQ ID NO: 15, SEQ ID NO: 16, and SEQ ID NO: 17. 

22. An assay kit for determining the presence of MMP 19 antigen or antibody in 
a test sample, comprising a container containing an antibody which specifically bins to 
MMP 19 anLigen, wherein said antigen comprises at least one epitope of MMP 19 having at 
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least a sequence selected from the group consisting of SEQ ID NO: 14 ? SEQ ID NO: 1 5, 
SEQ ID NO: 16, and SEQ ID NO: 17. 

23 . The kit of Claim 28 wherein said antibody is attached to a solid phase. 

24. A test kit useful for detecting MMP1 9 polynucleotide in a test sample, 
comprising a container containing at least one MMP19 polynucleotide which is capable of 
selectively hybridizing to the nucleic acid of MMP19 gene and wherein said purified 
polynucleotide has at least 50% identity to SEQ ID NO: 1 an fragments, analogs or 
complements thereof. 
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FIG . 1 

xxxxxxxxxxxxxxxxxxxLxxLLLLxxxxxxxLxxxxxxxxxxxxxxxxxxxxxYLxxYYxLx 
xxxxxxxxxxxxxxxxxxxLxxMOxFFGLxVTGKLDxxTLExHx KPRCGVPD xxxxxxxxxVxx 

FxLxPxxPKWxxxxxTYRIxNYTPDLxxxxVDxAIxKAFxVWSxVTPLxFxxVxxxxxxxxxEG 
xADIMIxFAxxEHGDxxPFDGPGGxLAHAFxPGPGIGGDAHFDDDExWTxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxGxNLFLVA AHExGHSLGLxHS xDPxAL 

MYPxYxxFxDxxxFxLxxDDIxGIQxLYGxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxPxxCDxxxxxxxxxxxxFDAIxxxRGExFFFKDRFFWRxx 
xxxxxxxxxxxLIxxFWPxLPxxIDAAYExxxxxxVFFFKGxxYWxYxGxxxxxGYPxxIxxxL 
GFPxxVxxIDAAxxxxxxxxKTYFFxxxxYWRYDExxxxMDPGYPKxIxxxFxGIxxxVDAVFx 
xxxxGFxYFFxGxxxYxFxxxxxxVxxxxxxxxxxxWLxCxxxxxxxxxxxxxxxxxxxxxxxx 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

FIG. 2 

1 50 
***** 

CCTNCACTNGATTNAGCGATGCCATNAGAGCGTTTNAGTGGGTGTNCCAG 
XTXXSDAXRAFXWVXQ 

51 100 

CTACCTGTNAGCGGCGTGTTGGACCGCGCCACCCTGCGCCAGATGANTNG 
LPVSGVLDRATLRQMXX 

101 150 

* * * * * 

TCCCCGGTGCGGGGTTACAGATACCAACAGTTATGNGGCCTGGGNTGAGA 
P R C G V T D TNSYXAWXE 

151 200 

* - * * * * 
GG ATC AG TG ANTT GTTTGC T AG AC AAC GG ACC AAAATG AGGCG T AAG AAA 
RISXLFARQRTKMR R K K 

201 250 
***** 
CGNTTTGCAAAGCAAGGTAACAAATGGTANAAGCAGCACCTNTTCTACCG 
_R FA'KQGNKWXKQHLFYR 

251 300 



NNTGGTGAACTGGGCTGAGNNTGTGCCGGAGCCGGCAGTTGGGGGCGCG 
XVNWAEXVPEPAVGGA 
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FIG . 3 



241 314 396 



BE3 1,036 1 . 1 92 1.322 1.430 1.5S5 I. TP 
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FIG. 4(a) 



pro-peptide 

AAAGCTCCCACCTCCACTCGATTCAGCGATGCCATCAGAGCGTTTCAGTG 
TTTCGAGGGTGGAGGTGAGCTAAGTCGCTACGGTAGTCTCGCAAAGTCAC 
KAPTSTRFSDAIRAFQW 



GGTGTCCCAGCTACCTGTCAGCGGCGTGTTGGACCGCGCCAACCTGCGCC 
CCACAGGGTCGATGGACAGTCGCCGCACAACCTGGCGCGGTTGGACGCGG 
VSQLPVSGVLDRANLR 



AGATGAC7CGTCCCCGCTGCGGGGTTACAGATACCAACAGTTATGCGGCC 
TCTACTGAGCAGGGGCGACGCCCCAATGTCTATGGTTGTCAATACGCCGG 
0 M T R P R C G V T D T N S Y A A 



TGGGC TGAGAGG ATC AGTGACTTGTTTGCTAG AC ACCGGAC C AAAATGAG 
AC CCG ACTCTC C TAGTC ACTGAAC AAACGATC TGTGGCC TGGTTTTACTC 
WAERIS.DLFARHRTKMR 

catalytic domain 

GCGTAAGAAACGCTTTGCAAAGCAAGGTAACAAATGGTACAAGCAGCACC 
CGCATTCTTTGCGAAACGTTTCGTTCCATTGTTTACCATGTTCGTCGTGG 
R K K R FAKQGNKWYKQH 



TCTCCTACCGCCTGGTGAACTGGCCTGAGCATCTGCCGGAGCCGGCAGTT 
AGAGGATGGCGGACCACTTGACCGGACTCGTAGACGGCCTCGGCCGTCAA 
LSYRLVNWPEHLPSPAV 



CGGGGCGCCGTGCGCGCCGCCTTCCAGTTGTGGAGCAACGTCTCAGCGCT 
GCCCCGCGGCACGCGCGGCGGAAGGTCAACACCTCGTTGCAGAGTCGCGA 
RGAVRA'AFQLWSNVSAL 



GGAGTTCTGGGAGGCCCCAGCCACAGGCCCCGCTGACATCCGGCTCACCT 
CCTCAAGACCCTCCGGGGTCGGTGTCCGGGGCGACTGTAGGCCGAGTGGA 
EFWEAPATGPADI RLT 



TCTTCCAAGGGGACCACAACGATGGGCTGGGCAATGCCTTTGATGGCCCA 
AGAAGGTTCCCCTGGTGTTGCTACCCGACCCGTTACGGAAACTACCGGGT 
FFQGDHNDGLGNAFDGP 



GGGGGCGCCCTGGCGCACGCCTTCCTGCCCCGCCGCGGCGAAGCGCACTT 
CCCCCGCGGGACCGCGTGCGGAAGGACGGGGCGGCGCCGCTTCGCGTGAA 
GGALAHAFLPF. RGEAHF 



CGACCAAGATGAGCGCTGGTCC- . .iAGCCGCCCCCGCGGGCGCAACCTGT 
GCTGGTTCTACTCGCGACCAGGGACTCGGCGGCGGCGCCCGCGTTGGACA 
DQDERWSLSRRRGRNL 
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FIG. 4(b) 

TCGTGGTGCTGGCGCACGAGATCGGTCACACGCTTGGCCTCACCCACTCG 
AGCACCACGACCGCGTGCTCTAGCCAGTGTGCGAACCGGAGTGGGTGAGC 
F V V L A HEIGHTLGLTHS 



CCCGCGCCGCGCGCGCTCATGGCGCCCTACTACAAGAGGCTGGGCCGCGA 
GGGCGCGGCGCGCGCGAGTACCGCGGGATGATGTTCTCCGACCCGGCGCT 
PAPRALMAPY.YKRLGRD 



CGCGCTGCTCAGCTGGGACGACGTGCTGGCCGTGCAGAGCCTGTATGGGA 
GCGCGACGAGTCGACCCTGCTGCACGACCGGCACGTCTCGGACATACCCT 
ALLSWDDVLAVQS LYG 

hinge region 

agcccctagggggctcagtggccgtccagctcccaggaaagctgttcact 
tcggggatcccccgagtcaccggcaggtcgagggtcctttcgacaagtga 
kplggsva'vqlpgklft 



GACTTTGAGACCTGGGACTCCTACAGCCCCCAAGGAAGGCGCCCTGAAAC 
CTGAAACTCTGGACCCTGAGGATGTCGGGGGTTCCTTCCGCGGGACTTTG 
DFETWDSY.SPQGR R' P ■ E T 

hemopexin domain 

GCAGGGCCCTAAATACTGCCACTCTTCCTTCGATGCCATCACTGTAGACA 
CGTCCCGGGATTTATGACGGTGAGAAGGAAGCTACGGTAGTGACATCTGT 
QGPKYCHSSFDAITVD 



GGCAACAGCAACTGTACATTTTTAAAGGGAGCCATTTCTGGGAGGTGGCA 
CCGTTGTCGTTGACATGTAAAAATTTCCCTCGGTAAAGACCCTCCACCGT 
RQQQLYIFKGS.HFWEVA 



GCTGATGGCAACGTCTCAGAGCCCCGTCCACTGCAGGAAAGATGGGTCGG 
CGACTACCGTTGCAGAGTCTCGGGGCAGGTGACGTCCTTTCTACCCAGCC 
ADGNVSEPRPLQ. ERWVG 



G CTG C CCC C C AAC ATTG AGGC TGCGGCAGTGTC ATTG AATGATGGAGATT 
CGACGGGGGGTTGTAACTCCGACGCCGTCACAGTAACTTACTACCTCTAA 
LPPNI EAAAVSLNDGD 



TCTACTTCTTCAAAGGGGGTCGATGCTGGAGGTTCCGGGGCCCCAAGCCA 
■AGATGAAGAAGTTTCCCCCAGCTACGACCTCCAAGGCCCCGGGGTTCGGT 
FYFFKGGRCWRFRGPKP 



GTGTGGGGTCTCCCACAGCTGTGCCGGGCAGGGGGCCTGCCCCGCCATCC 
CACACCCCAGAGGGTGTCGACACGGCCCGTCCCCCGGACGGGGCGGTAGG 
VWGLPQLCRAGGLPRHP 



TGACGCCGCCCTCTTCTTCCCTCCTCTGCGCCGCCTCATCCTCTTCAAGG 
ACTGCGGCGGGAGAAGAAGGGAGGAGACGCGGCGGAGTAGGAGAAGTTCC 
DAALFFPPLRRLILFK 
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FIG. 4(c) 

GTGCCCGCTACTACGTGCTGGCCCGAGGGGGACTGCAAGTGGAGCCCTAC 1200 
CACGGGCGATGATGCACGACCGGGCTCCCCCTGACGTTCACCTCGGGATG 
GARYYVLARGG LQVEPY 

TACCCCCGAAGTCTGCAGGACTGGGGAGGCATCCCTGAGGAGGTCAGCGG 1250 
ATGGGGGCT7CAGACGTCCTGACCCCTCCGTAGGGACTCCTCCAGTCGCC 
YPRSLQDWGGI PEEVSG 



CGCCCTGCCGAGGCCCGATGGCTCCATCATCTTCTTCCGAGATGACCGCT 13 00 

GCGGGACGGCTCCGGGCTACCGAGGTAGTAGAAGAAGGCTCTACTGGCGA 
ALPRPDGSIIFFRDDR 



ACTGGCGCCTCGACCAGGCCAAACTGCAGGCAACCACCTCGGGCCGCTGG 13 50 

TGACCGCGGAGCTGGTCCGGTTTGACGTCCGTTGGTGGAGCCCGGCGACC 

YWRLDQAKLQATTSGRW 



GCCACCGAGCTGCCCTGGATGGGCTGCTGGCATGCCAACTCGGGGAGCGC 1400 
CGGTGGCTCGACGGGACCTACCCGACGACCGTACGGTTGAGCCCCTCGCG 
ATELPWMGCWHAMSGSA 



CCTGTTCTGAAGGCACCTCCTCACCTCAGAAACTGGTGGTGCTCTCAGGG 1450 
GGACAAGACTTCCGTGGAGGAGTGGAGTCTTTGACCACCACGAGAGTCCC 
L F * 

CAAAATCATGTTCCCCACCCCCGGGGCAGAACCCCTCTTAGAAGCCTCTG 1500 
GTTTTAGTACAAGGGGTGGGGGCCCCGTCTTGGGGAGAATCTTCGGAGAC 

AGTCCCTCTGCAGAAGACCGGGCAGCAAAGCCTCCATCTGGAAGTCTGTC 15 50 

TCAGGGAGACGTCTTCTGGCCCGTCGTTTCGGAGGTAGACCTTCAGACAG 

TGCCTTTGTTCCTTGAAGAATGCAGCATTGTCTTTGTCTGTCCCCACCAC 1600 
ACGGAAACAAGGAACTTCTTACGTCGTAACAGAAACAGACAGGGGTGGTG 

ATGGAGGTGGGGGTGGGATCAATCTTAGGAAAAGCMMMGGGTCCCAG 1650 
TACCTCCACCCCCACCCTAGTTAGAATCCTTTTCGTTTTTTCCCAGGGTC 

ATCCCTTGGCCCTTTCCTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 1700 
TAGGGAACCGGGAAAGGAGGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTT 



AAAAAAAAAAAAAAAAA 



WO 98/40475 



PCT/US98/04694 



6/16 

FIG. 5(a) 



50 



KMP19 

Macrilysin 

Gelatinase A 

Gelatinase B 

Stromelysin 1 

Stromelysin 

Stromelysin 

Collagenase 

Collagenase 

Collagenase 

MMP12 

MMP18 

mtlMMP 
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mt3MMP 
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MRLTVLC AVCL.LPGSL ALPLPQEAGG 

MEALMARG ALTGPLRALC LLGCLLSKA. AAAPSPIIKF 

] [ [ MSLWQ PL. . . VLVLL VLGCCFAA? . RQRQSTLVLF 

MKSLPILL LLCVAVCSAY PLDGAARGED 

[ . . . . . [ [ MMHLAFLV LLCLPVCSAY PLSGAAKEED 

MAPAAWLRSA AARALLPPML LLLLQPPPLL ARALPPDVHH 

M HSFPPLLLLL FWGV.VSHSF PATLETQEQD' 

V [] ' " [ ' ' ' m FSLKTLPFLL LLHVQISKAF P. . VSSKEKN 

[ [ ] , [ , [ MHPGVLAAFL FLSWTHCRAL PLPSGGDEDD 

][[[][..[. \ MKFLLIL LLQATASGAL PLNSSTSLEK 

. ] [ . [ [ [ . MNCQQLWLG FLLPMTVSGR VLGLAEVAPV 

!.!'m£PAPR PSRC LPLLTLGTAL ASLGSAQSSS 

MGSDPSAPGR PGWTGSLLGD REEAARPRLL PLLLVLLGCL GLGVAAEDAE 
MILLTFSTGR RLDFVHHSGV FFLQTLLWIL CATVCGTEQY 



Consensus 
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FIG. 5(c) 
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FIG. 5(d) 
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FIG. 5{e) 
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FIG. 5(i> 
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FIG. 6 
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